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Introduction 


The  1997  edition  of  ALA-LC  Romanization  Tables  contains  54  romanization  and  transliteration 
schemes,  the  majority  of  which  have  also  appeared  in  the  Library  of  Congress  Cataloging  Service 
Bulletin.  These  tables  were  developed  for  use  when  the  consistent  transliteration  of  a  non-Roman 
(vernacular)  script  (e.g.,  Arabic  script)  into  the  Roman  alphabet  is  needed.  This  publication  covers  more 
than  150  languages  written  in  various  non-Roman  scripts. 

Transliteration  of  non-Roman  scripts  is  done  by  librarians,  archivists,  information  specialists, 
writers,  etc.  when  translation  into  English  or  some  other  Roman  script  language  is  not  desirable  or 
possible.  This  is  often  the  case  with  proper  names,  titles,  and  terms  for  which  no  appropriate  Roman 
script  equivalant  exists.  For  some  scripts  the  process  of  romanization  or  transliteration  involves  imparting 
characteristics  of  the  Roman  script  such  as  capitalization  and  word  division.  The  Thai  script  is  typical 
of  one  that  has  neither  of  these  characteristics.  (There  are  no  separate  upper  and  lower  case  forms  for 
letters  in  the  Thai  script  and  words  are  not  separated  by  spaces.)  ALA-LC  Romanization  Tables  provides 
special  guidelines  for  capitalization  and  word  division  for  certain  scripts. 


Development  of  ALA-LC  Romanization  Tables 

Romanization  schemes  have  been  developed  over  the  years  by  foreign  language  specialists  at 
Library  of  Congress,  often  with  the  help  of  scholars  and  experts  at  other  institutions.  This  work  is  done 
to  support  the  cataloging  of  foreign  language  materials.  Bibliographic  information  appearing  in 
vernacular  scripts  is  routinely  romanized  so  that  it  can  be  incorporated  into  the  Library’s  large  catalog 
and  shared  with  other  institutions.  Cataloging  information  in  the  vernacular  is  often  included  with  the 
romanization. 

As  tables  for  new  scripts  are  developed  they  are  reviewed  and  approved  by  the  American  Library 
Association.  This  provides  input  from  a  wider  group  of  potential  users.  The  development  of 
romanization  schemes  is  an  ongoing  process.  Periodically  new  romanization  schemes  are  developed  for 
scripts  for  which  no  scheme  exists.  Existing  schemes  are  occasionally  revised  to  reflect  better 
understanding  of  a  script  or  to  provide  an  improved  rendering  of  it  in  the  Roman  script. 


Changes  in  this  Edition 

The  tables  in  the  1997  edition  include  four  new  tables  that  cover  Azerbaijani,  Balinese,  Javanese, 
Madurese,  Malay,  and  Sundanese.  As  with  existing  tables,  information  has  been  formatted  for  legibility. 
Many  of  the  older  tables  have  been  changed  to  correct  typographical  errors  and  omissions. 


Special  Features  of  the  Tables 

For  each  non-Roman  script,  a  font  was  selected  that  presents  each  character  (i.e.,  alphabetic, 
syllabic,  ideographic,  etc.)  as  unambiguously  as  possible.  For  some  scripts  certain  characters  are 
stylized.  This  was  done  to  make  similar  characters  more  easily  distinguished  from  one  another. 
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The  organization  of  characters  belonging  to  each  script  follows  the  traditional  arrangement  of 
characters  for  the  script.  Often,  this  is  not  in  the  order  of  the  Roman  alphabet. 

For  scripts  where  certain  characters  and  character  modifiers  can  appear  in  combination  with  others 
(e.g.,  vowel  marks  written  above  or  below  other  characters),  an  unspecified  character  is  represented  by 
the  convention  (e.g.,  6  represents  the  acute  accent  which  can  appear  over  a  variety  of  alphabetic 
characters).  This  convention  is  only  used  for  characters  which  are  normally  written  in  combination  with 
other  characters.  These  characters  are  referred  to  as  character  modifiers  and  are  generally  considered 
non-spacing. 

Most  scripts  included  special  characters  that  are  used  in  addition  to  the  regular  letters  of  an 
alphabet,  syllabary,  or  system  of  ideographs.  These  special  characters  occupy  a  space  by  themselves 
(e.g.,  ’  apostrophe).  When  presented  in  one  of  these  romanization  tables,  they  are  not  shown  in 
conjunction  with  any  □ . 

The  romanization  shown  for  each  character  often  includes  one  or  more  special  characters  or 
character  modifiers.  Special  attention  should  be  paid  to  this  part  of  each  romanization  scheme.  For 
characters  that  might  be  misidentified,  a  special  legend  (e.g.,  (soft  sign))  is  included  in  parentheses 
following  the  character.  When  a  character  has  no  roman  value  this  is  indicated  by  the  presence  of  a 
hyphen  (-),  the  word  "omit" ,  or  "disregard"  in  the  romanization  table. 

Examples  have  been  included  of  vernacular  text  for  some  scripts.  These  examples  are  intended 
to  be  descriptive  of  the  rule  or  principle  presented. 

NOTE:  In  the  application  of  the  ALA-LC  Romanization  Tables ,  familiarity  with  the  language  involved 
is  helpful,  if  not  essential,  for  many  scripts. 
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Amharic 


Amharic 


Syllables 


1st  < 

Order 

2nd 

Order 

3rd 

Order 

4th 

Order 

5th 

Order 

6th 

Order 

7th 

Order 

U 

ha 

Ih 

hu 

Y. 

hi 

V 

ha 

ft 

he 

I) 

he  or  h 

If 

ho 

A 

la 

A* 

lu 

A. 

li 

A 

la 

A» 

le 

A 

le  or  1 

A” 

lo 

rh 

ha 

th* 

hu 

Ax. 

hi 

Ax 

ha 

• 

itu 

he 

Ax 

he  or  h 

#*» 

ho 

ao 

ma 

{JO* 

mu 

°X 

mi 

ma 

°% 

me 

9° 

me  or  m 

mo 

ip 

sa 

V* 

su 

*X 

/ , 

si 

*7 

sa 

se 

/•* 

se  or  s 

Y' 

so 

i. 

ra 

<* 

ru 

6 

ri 

ra 

re 

C 

re  or  r 

e 

ro 

A 

sa 

it 

su 

A. 

si 

A 

sa 

A> 

se 

A 

se  or  s 

A 

so 

n 

V 

sa 

if 

V 

su 

rf. 

V. 

si 

tf 

v — 

sa 

7C 

se 

Ti 

se  or  s 

V 

V 

so 

+ 

qa 

•k 

qu 

qi 

qa 

$ 

qe 

4* 

qe  or  q 

* 

qo 

n 

ba 

a* 

bu 

0. 

bi 

n 

ba 

n> 

be 

•fl 

be  or  b 

n 

bo 

ta 

*i* 

tu 

ti 

p 

ta 

te 

te  or  t 

+ 

to 

¥ 

V 

ca 

* 

V 

cu 

X 

Cl 

? 

V— 

ca 

* 

ce 

ce  or  c 

¥ 

V 

CO 

ha 

hu 

X 

hi 

P 

ha 

•% 

he 

he  or  h 

•ft* 

ho 

V 

na 

V- 

nu 

ni 

c 

na 

i 

ne 

ne  or  n 

ft* 

no 

7 

na 

*7 

iiu 

X 

ni 

na 

H 

ne 

ne  or  n 

no 

ft 

’a 

h* 

’u 

K 

’i 

h 

’a 

K 

’e 

A 

’e 

ft 

’o 

h 

ka 

ku 

K 

ki 

u 

ka 

\b 

ke 

h 

ke  or  k 

Y» 

ko 

* 

xa 

Ti- 

XU 

Ti. 

xi 

ft 

xa 

Ti. 

xe 

Ti 

xe  or  x 

Ti 

xo 

w 

wa 

<DL 

wu 

X 

wi 

<p 

wa 

•£ 

we 

«>* 

we  or  w 

f* 

wo 

0 

‘a 

a* 

‘u 

‘i 

ft 

‘a 

ft. 

‘e 

A 

‘e 

r 

‘o 

H 

za 

If 

zu 

it 

zi 

n 

za 

It 

ze 

ftl 

ze  or  z 

H 

zo 

7f 

V 

za 

TF 

V 

zu 

Tt 

v , 

Zl 

nr 

V  — 

za 

W 

ze 

Pf 

ze  or  z 

x 

zo 

% 

V 

za 

X 

V 

zu 

X 

V  . 

Zl 

PC 

za 

Pf 

ze 

X 

ze  or  z 

x 

V 

zo 

- 

- 

- 

- 

- 

ze  or  z 

- 

f 

ya 

yu 

pl 

yi 

? 

ya 

ft. 

ye 

A 

ye  or  y 

p- 

yo 

ft 

X 

da 

V 

ga 

ft* 

% 

du 

V 

gu 

ft. 

% 

di 

v , 

gi 

ft 

t 

da 

ga 

& 

£ 

de 

v  .. 
ge 

de  or  d 
ge  or  g 

do 

V 

go 

7 

ga 

7* 

gu 

X 

gi 

p 

ga 

% 

ge 

ge  or  g 

go 

m 

ta 

• 

m- 

tu 

m. 

ti 

• 

ta 

iJh 

te 

• 

T 

te  or  t 

#n 

to 

ax 

ca 

cu 

at 

•  • 

Cl 

**L 

ca 

mr 

•  * 
ce 

ce  or  c 

CO 

A 

pa 

A* 

pu 

& 

Pi 

ft 

Pa 

ft. 

pe 

ft* 

pe  or  p 

ft 

Ro 

ft 

sa 

ft* 

su 

ft. 

si 

ft 

sa 

SU 

se 

ft* 

se  or  s 

ft 

so 

0 

sa 

0* 

su 

t 

k 

f 

sa 

ft 

se 

A 

se  or  s 

io 

A 

fa 

fu 

& 

fi 

4* 

fa 

fe 

¥ 

fe  or  f 

G 

fo 

T 

pa 

pu 

X 

Pi 

T 

pa 

•c 

pe 

T 

pe  or  p 

po 

H 

va 

ft* 

vu 

a 

vi 

ft 

va 

ft 

ve 

.<T 

ve  or  v 

r7 

VO 

Combinations  with  w 

I* 

qwa 

- 

tfn. 

qwi 

* 

qwa 

£ 

qwe 

db^ 

qwe 

- 

> 

hwa 

- 

hwi 

X 

hwa 

hwe 

hwe 

- 

kwa 

- 

h«* 

kwi 

K 

kwa 

kwe 

YA 

kwe 

- 

> 

gwa 

- 

> 

gwi 

ft 

gwa 

> 

gwe 

r 

gwe 

- 
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Combinations  with  wa: 


lwa 

a. 

bwa 

H 

zwa 

OK 

twa 

mwa 

* 

twa 

TC 

zwa 

ax 

cwa 

Si  k 

rwa 

£ 

cwa 

¥ 

ywa 

swa 

swa 

V 

nwa 

& 

dwa 

AS  ¥ 

fwa 

swa 

X 

nwa 

& 

gwa 

- 

Combinations  with  ya: 

X  z: 

rya 

mya 

% 

fya 

Special  Initial: 

75 

a 

Numerals: 

&  l 

«r-»  A 

S  2 

£  3 

2  4 

§  5 

%  6 

%  7 

oo 

*KI 

l  10 

S'  20 

0  30 

9  40 

3  50 

S  60 

§  70 

If  SI 

l  100 


Notes 

1.  The  Ethiopic  script  used  for  Amharic  is  also  used  for  other  languages,  including  Ge’ez, 
Argobba,  Gurage,  and  Tigre.  Ge’ez,  which  is  chiefly  a  liturgical  language,  uses  only  26  basic 
letter  forms  from  this  table. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

5 

alif 

AE 

< 

ayn 

B0 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

s 

acute 

E2 

Li 

tilde 

E4 

i.J 

macron 

E5 

breve 

E6 

n 

dot  above 

E7 

x 

hachek 

E9 

• 

dot  below 

F2 

underscore 

F6 
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Arabic 


Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit  (see  Note 

mJ 

+ 

b 

J 

Ji. 

c*. 

CJ 

t 

J 

c- 

G 

th 

£■ 

c 

j 

e- 

c 

h 

c- 

t 

kh 

x 

d 

i» 

i 

dh 

y- 

-/ 

r 

J 

Jr 

J" 

J 

z 

mjj4m 

v>**- 

s 

JJj 

u- 

sh 

•4*3 

-+*>- 

U*3- 

u*3 

s 

• 

•4*3 

•4*3^ 

O*3- 

<J° 

d 

JL 

L. 

Jo. 

JU 

t 

J- 

1*. 

Jo. 

Jo 

z 

X 

* 

e- 

6 

‘  (ayn) 

X 

A 

e- 

e 

gh 

J 

-k 

t_0. 

ua 

f  (see  Note  2) 

ji 

JL 

O- 

q  (see  Note  2) 

d. 

d 

k 

J 

1 

J- 

J 

1 

JQ 

A. 

P* 

p 

m 

J 

O- 

o 

n 

.O 

■d 

,  4. 

O  4  O 

h  (see  Note  3) 

J 

> 

3- 

w 

-J 

Tf- 

er- 

»3 

y 

Vowels  and  Diphthongs 


□ 

a 

in 

a 

(see 

Rule  5) 

^9 

-r 

l 

* 

□ 

u 

v$D 

* 

a 

(see 

Rule  6(a)) 

o  *** 

aw 

□ 

i 

u 

O  ^ 

cSD 

ay 
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Letters  Representing  Non-Arabic  Consonants 

This  list  is  not  exhaustive.  It  should  be  noted  that  a  lettter  in  this  group  may  have  more  than  one 
phonetic  value,  depending  on  the  country  or  area  where  it  is  used,  and  that  the  romanization  will  vary 
accordingly. 

£  ch  <3  v 

g  zh  3  v 

fj  zh  «-£  v 

Notes 

1.  For  the  use  of  alif  to  support  hamzah,  see  rule  2.  For  the  romanization  of  hamzah  by  the 
consonantal  sign  ’  (alif)  see  rule  8(a).  For  other  orthographic  uses  of  alif  see  rules  3-5. 

2.  The  Maghiib  ^variations  and  o  are  romanized  /  and  q  respectively. 

3.  6  in  a  word  in  the  construct  state  is  romanized  t.  See  rule  7(b). 

RULES  OF  APPLICATION 

Arabic  Letters  Romanized  in  Different  Ways  Depending  on  Their  Context 

1.  As  indicated  in  the  table,  3  and  ^5  may  represent: 

(a)  The  consonants  romanized  w  and  y,  respectively 

wad‘ 

‘iwad 
dalw 
yad 
hiyal 
tahy 

(b)  The  long  vowels  romanized  u,  1,  and  a  respectively 

ula 
surah 
dhu 
Tman 

jH 

fT 

kitab 
sahab 
juman 

See  also  rules  11(a)  and  ll(bl-2). 


u^3* 

3^ 


ST* 


3 

3 


g 

g 

P 
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(c)  The  diphthongs  romanized  aw  and  ay,  respectively 


awj 

nawm 

PJH 

law 

aysar 

shaykh 

‘aynay 

See  also  rules  11(a)(2)  and  11(b)(3). 

2.  I  ( alif ),  3  and  v_S  when  used  to  support  <=  ( hamzah )  are  not  represented  in  romanization.  See  rule 
8(a). 

3.  I  ( alif)  when  used  to  support  □  ( waslah )  and  □  (maddah)  is  not  represented  in  romanization. 
See  rules  9  and  10. 


4. 


5. 


6. 


I  ( alif)  and  3  when  used  as  orthographic  signs  without  phonetic  significance  are  not  represented 
in  romanization. 

fa‘alu 

ula’ika 

‘ilman  wa-‘amalan  Lolc 

See  also  rule  12  and  examples  cited  in  rules  23-26. 

I  ( alif)  is  used  to  represent  the  long  vowel  romanized  a,  as  indicated  in  the  table. 
fa‘il  J^La 

rid  a 

This  I,  when  medial,  is  sometimes  omitted  in  Arabic;  it  is  always  indicated  in  romanization.  See 
rule  19. 

Final  k3  appears  in  the  following  special  cases: 

(a)  As  k3D-  (alif  maqsurah)  used  in  place  of  to  represent  the  long  vowel  romanized  d. 


hatta 

-  Vs? 

mada 

kubra 

vS^ 

Yahya 

musamma 

Mustafa 
•  • 
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(b)  As  ^3  9  in  nouns  and  adjectives  of  the  form  fa  ‘U  which  are  derived  from  defective  roots. 
This  ending  is  romanized  T,  not  ty,  without  regard  to  the  presence  of  B  ( shaddah ).  See 
rule  ll(b2). 

Radfal-Dm 


7. 


Compare  the  fa  ‘Tl  form  of  the  same  root  [with  out  shaddah ]  al-RadT. 

(c)  As  ,3  □  in  the  relative  adjective  ( nisbah ).  The  ending,  like  (b)  above,  is  romanized  i,  not 
Ty. 

al-Misrf 

Compare  al-Misriyah  and  see  rule  ll(bl). 

®  (ta*  marbutah ) 


(a)  When  the  noun  or  adjective  ending  in  &  is  indefinite,  or  is  preceded  by  the  definite  article, 
6  is  romanized  h.  The  &  in  such  positions  is  often  replaced  by 

salah 

al-Risalah  al-bahiyah  A'O 

mir‘ah  2>7 

Urjuzah  ffal-tibb  «-i=Jl  ^  6 

(b)  When  the  word  ending  in  &  is  in  the  construct  state,  &  is  romanized  t. 

Wizarat  al-Tarbiyah 
Mir’at  al-zaman 

(c)  When  the  word  ending  in  6  is  used  adverbally,  &  (vocalized  &)  is  romanized  tan.  See  rule 
12(b). 


Romanization  of  Arabic  Orthographic  Symbols  Other  than  Letters  and  Vowel  Signs 

The  signs  listed  below  are  frequently  omitted  from  unvocalized  Arabic  writing  and  printing;  their 
presence  or  absence  must  then  be  inferred.  They  are  represented  in  romanization  according  to  the 
following  rules: 

8.  *  ( hamzah ) 

(a)  In  initial  position,  whether  at  the  beginning  of  a  word,  following  a  prefixed  preposition  or 
conjunction,  or  following  the  definite  article,  «=  is  not  represented  in  romanization.  When 
medial  or  final,  *  is  romanized  as  ’  (alif). 


ALA-LC  Romanization  Tables 


page  13 


Arabic 


asad 

uns 

idha 

ISI 

p 

mas’alah 

AJI-ulUO 

mu’tamar 

da’im 

pj,b 

mala’a 

>> 

khati’a 

• 

(b)  *  ,  when  replaced  by  the  sign  □  (waslah)  and  then  known  as  hamzat  al-wasl ,  it  is  not 

represented  in  romanization.  See  rule  9  below. 

9.  □  (waslah),  like  initial  9  ,  is  not  represented  in  romanization.  See  also  rule  8(b)  above.  When 
the  alif  which  supports  waslah  belongs  to  the  article  Jl  ,  the  initial  vowel  of  the  article  is 
romanized  a.  See  rule  17(b).  In  other  words,  beginning  with  hamzat  al-wasl,  the  initial  vowel 
is  romanized  i. 

Rihlat  Ibn  Jubayr 
al-istidrak 
kutub  iqtanat'ha 
bi-ihtimam  ‘Abd  al-Majfd 

10.  □  (i maddah ) 

(a)  Initial  I  is  romanized  a 

alah 

Kulliyat  al-Adab 

(b)  Medial  I,  when  it  represents  the  phonetic  combination  'a,  is  so  romanized. 

ta’alff 

ma’athir  Lo 

(c)  □  is  otherwise  not  represented  in  romanization. 

khulafa’  * 

11.  G  (shaddah  or  tashdud) 

(a)  Over  j  : 

(1)  3U ,  representing  the  combination  of  long  vowel  plus  consonant,  is  romanized  uw. 
‘aduw  j 

quwah 

See  also  rule  1(b). 


^1 


vill 


\  i<» a  rr  Nit  p La g)T 


iiJT 

A& I 
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12. 


(2)  jD ,  representing  the  combination  of  diphthong  plus  consonant,  is  romanized  aww. 
Shawwal 
sawwara 
jaww 

See  also  rule  1(c). 

(b)  Over  : 

(1)  Medial  v5Q>  representing  the  combination  of  long  vowel  plus  consonant,  is 
romanized  Ty. 

al-Misnyah 

«** 

See  also  rule  1(b) 

(2)  Final  vj;Q  is  romanized  T.  See  rules  6(b)  and  6(c). 


Pm 


(3)  Medial  and  final  »jsi5 ,  representing  the  combination  of  diphthong  plus  consonant,  is 
romanized  ayy. 
ayyam 
sayyid 

Qusayy  Zt-*^ 

See  also  rule  1(c) 

(c)  Over  other  letters,  fi  is  represented  in  romanization  by  doubling  the  letter  or  digraph 
concerned. 

al-Ghazzf 

al-Kashshaf 

Tanwm  may  take  the  written  form  ff,  □  (In),  or  9,  romanized  un,  an,  and  in,  respectively. 
Tanwm  is  normally  disregarded  in  romanization,  however.  It  is  indicated  in  the  following  cases: 

(a)  When  it  occurs  in  indefinite  nouns  derived  from  defective  roots. 

qadin 


ma  nan 


(b)  When  it  indicates  the  adverbial  use  of  a  noun  or  adjective. 


tab‘an 

\^L 

faj  ’  atan 

bis*. i 

al-Mushtarik  wad‘an  wa-al-muftariq 

suq‘an 

Itt  <i  ^ 
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Grammatical  Structure  as  It  Affects  Romanization 


13. 


Final  inflections  of  verbs  are  retained  in  romanization 
man  waliya  Misr 
ma‘rifat  ma  yajibu  la-hum 
salla  Allah  ‘alayhi  wa-sallam 
al-Lu’lu’  al-maknun  ff  hukm  al-akhbar 
‘amma  sayakun 


,  except  in  pause. 

Lo 

p-LoJJ  <\_Jx  All  I  V'7_L*£3 
Cij)  i^/>l I  J|J J>JJ I 

(jj jj.u  Lo£ 


14.  Final  inflections  of  nouns  and  adjectives: 


(a)  Vocalic  endings  are  not  represented  in  romanization,  except  preceding  pronominal  suffixes, 
and  except  when  the  text  being  romanized  is  in  verse. 

Ma‘had  Mawlaya  al-Hasan  ^0^^° 

usuluha  al-nafsiyah  wa-turuq 
tadrisiha 

ila  yawmina  hadha  ^ 

(b)  Tanwin  is  not  represented  in  romanization,  except  as  specified  in  rule  12. 

(c)  o  (ta’  marbutah )  is  romanized  h  or  t  as  specified  in  rule  7. 

(d)  For  the  romanization  of  the  relative  adjective  ( nisbah )  see  rule  6(c). 


15.  Pronouns,  pronominal  suffixes,  and  demonstratives: 


(a)  Vocalic  endings  are  retained  in  romanization. 
ana  wa-anta 
hadhihi  al-hal 

mu  ’  allafatuhu  wa-shuruhuha 


;U  LI 


'J 

JbJl  oia> 
1-9-1 


(b)  At  the  close  of  a  phrase  or  sentence,  the  ending  is  romanized  in  its  pausal  form, 
hayatuhu  wa-‘asruh  ^L^ 

Tawfiq  al- Hakim,  <  p^-JI 

afkaruh,  atharuh  ovbl 


16.  Prepositions  and  conjunctions: 

(a)  Final  vowels  of  separable  prepositions  and  conjunctions  are  retained  in  romanization. 
anna  O I 

annahu  *^1 

bayna  yadayhu  <xrt±J. 

Note  the  special  cases:  mimma,  mimman. 
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(b)  Inseparable  prepositions,  conjunctions,  and  other  prefixes  are  connected  with  what  follows 
by  a  hyphen. 

bi-hi  ^ 

wa-ma‘ahu 

la-sillu 


17.  The  definite  article: 


(a)  The  romanized  form  al  is  connected  with  the  following  word  by  a  hyphen. 


al-kitab  al-thanf 

VL^JI 

al-ittihad 

al-asl 

al-athar 

(b) 


When  Jl  is  initial  in  the  word,  and  when  it  follows  an  inseparable  preposition  or 
conjunction,  it  is  always  romanized  al  regardless  of  whether  the  preceding  word,  as 
romanized,  ends  in  a  vowel  or  a  consonant. 


ila  al-an  o  'Jl 

Abu  al- Wafa’ 

Maktabat  al-Nahdah  al-Misriyah  <u^o 

bi-al-tamam  wa-al-kamal  plojJL 


Note  the  exceptional  treatment  of  the  preposition  J  followed  by  the  article: 
lil-Shirbmf 


See  also  rule  23. 

(c)  The  J  of  the  article  is  always  romanized  /,  whether  it  is  followed  by  a  "sun  letter"  or  not, 
i.e.,  regardless  of  whether  or  not  it  is  assimilated  in  pronunciation  to  the  initial  consonant 
of  the  word  to  which  it  is  attached, 
al-huruf  al-abjadlyah 

Abu  al-Layth  al-Samarqandf  ^-JJI 


Orthography  of  Arabic  in  Romanization 

18.  Capitalization: 

(a)  Rules  for  the  capitalization  of  English  are  followed,  except  that  the  definite  article  al  is 
given  in  lower  case  in  all  positions. 

(b)  Diacritics  are  used  with  both  upper  and  lower  case  letters. 

al-Ijf 

al-Alusf  ^ 
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19. 


The  macron  or  the  acute  accent,  as  appropriate,  is  used  to  indicate  all  long  vowels,  including 
those  which  in  Arabic  script  are  written  defectively.  The  macron  or  the  acute  accent,  as  the  case 
may  be,  is  retained  over  final  long  vowels  which  are  shortened  in  pronunciation  before  hamzat 


al-wasl. 

Ibrahim 
Da’  ud 

Abu  al-Hasan 

ru’us 

dhalika 


i  ^-j\ 

eiJi 


‘ala  al-‘ayn 


20.  The  hyphen  is  used: 


(a)  To  connect  the  definite  article  al  with  the  word  to  which  it  is  attached.  See  rule  17(a). 

(b)  Between  an  inseparable  prefix  and  what  follows.  See  rules  16(b)  and  17(b)  above. 

(c)  Between  bin  and  the  following  element  in  personal  names  when  they  are  written  in  Arabic 
as  a  single  word.  See  rule  25. 


21.  The  prime  (')  is  used: 


(a)  To  separate  two  letters  representing  two  distinct  consonantal  sounds,  when  the 
combination  might  otherwise  be  read  as  a  digraph. 

Ad '  ham 

akramat 1  ha  I 

(b)  To  mark  the  use  of  a  letter  in  its  final  form  when  it  occurs  in  the  middle  of  a  word. 

Qahah'jf 

Shaykh'zadah 


22.  As  in  the  case  of  romanization  from  other  languages,  foreign  words  which  occur  in  an  Arabic 
context  and  are  written  in  Arabic  letters  are  romanized  according  to  the  rules  for  romanizing 
Arabic. 

Jarmanus  {not  Germanos  nor  Germanus)  ^^Jlo 

Lurd  Ghranffl  {not  Lord  Granville)  ^>1 

IsaghujT  {not  Isagoge)  LuJ 


For  short  vowels  not  indicated  in  the  Arabic,  the  Arabic  vowel  nearest  to  the  original 
pronunciation  is  supplied. 

Gharsiya  Khayin  {not  Garcia  Jaen)  O-tr 
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Examples  of  Irregular  Arabic  Orthography 

23.  Note  the  romanization  of  Aiil,  alone  and  in  combination. 


Allah 

Aiil 

billah 

AhL 

lillah 

Ail 

bismillah 

Ajjl 

al-Mustansir  billah 
• 

Note  the  romanization  of  the  following  personal 

names. 

Taha 

Yasm 

‘Amr 

Bahjat 

CmI  and  Ch  are  both  romanized  ibn  in  all  positions. 

Ahmad  ibn  Muhammad  ibn  Abf  al-RabP 
•  • 

g.i.J ^y-J  l  Cj-f 

Sharh  Ibn  ‘Aqil  ‘ala  Alfiyat 

Aj^JI  (_J/l  C 

Ibn  Malik 

V  *1J  Lq 

Exception  is  made  in  the  case  of  modern  names,  typically  North  African,  in  which  the  element 
Ch  is  pronounced  bin. 

Bin  Khiddah  O-t 

Bin-‘Abd  Allah  All  I  lu 

26.  Note  the  anomalous  spelling  asLo  romanized  mi’ah. 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

) 

alif  (hamzah) 

AE 

C 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

dot  below 

F2 
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Armenian 


Vernacular 

Romanization 

Vernacular 

Romanization 

Upper  case  letters 

Lower  case  letters 

U 

A 

MX 

a 

P 

B  [P] 

(see  Note  1) 

P 

b  [p] 

(see  Note  1) 

3 

G  [K] 

(see  Note  1) 

g  M 

(see  Note  1) 

T 

D  [T] 

(see  Note  1) 

d  [t] 

(see  Note  1) 

b 

0 

(see  Note  2) 

b 

{y 

(see  Note  2) 

a 

z 

q. 

z 

E 

t 

e 

c 

£ 

P 

V 

e 

p- 

T* 

P 

t‘ 

& 

Zh 

(see  Note  3) 

d 

zh 

(see  Note  3) 

b 

I 

1* 

i 

l 

L 

l 

1 

Kh 

(see  Note  3) 

N 

kh 

(see  Note  3) 

it 

i. 

Ts  [Dz]  (see  Notes  1,  3) 

t 

ts  [dz] 

(see  Notes  1,  3) 

H 

K  [G] 

(see  Note  1) 

k 

k  [g] 

(see  Note  1) 

* 

H 

< 

h 

9 

Dz  [Ts]  (see  Notes  1,  3) 

6 

dz  [ts] 

(see  Notes  1,  3) 

a 

Gh 

(see  Note  3) 

*1 

gh 

(see  Note  3) 

& 

Ch  [J] 

(see  Note  1) 

6 

ch  [j] 

(see  Note  1) 

U* 

M 

if 

m 

3 

<fY 

i 

fy 

\h 

(see  Note  4) 

J 

lh 

(see  Note  4) 

•b 

N 

b 

n 

0 

Sh 

(see  Note  3) 

2 

sh 

(see  Note  3) 

n 

O 

n 

o 

s 

Ch‘ 

2 

ch‘ 

P  [B] 

(see  Note  1) 

P  [b] 

(see  Note  1) 

a 

J  [Ch] 

(see  Note  1) 

Z 

j  [ch] 

(see  Note  1) 

n* 

R 

a 

r 

u 

S 

u 

s 

'i 

V 

k 

V 

s 

T  [D] 

(see  Note  1) 

in 

t  [d] 

(see  Note  1) 

p 

R 

P 

r 

3 

Ts‘ 

3 

ts  * 

b 

W 

i 

w 

ru 

U 

ru 

u 

4» 

P‘ 

+ 

p‘ 

* 

K‘ 

■f 

k‘ 

bi 

Ew 

(see  Note  5) 

b  i 

ew 

(see  Note  5) 

b*i 

Ev 

(see  Note  6) 

bk 

ev 

(see  Note  6) 

0 

O 

0 

o 

£b 

F 

* 

f 
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Notes 

1.  The  table  is  based  on  the  phonetic  values  of  Classical  and  East  Armenian.  The  variant  phonetic 
values  of  West  Armenian  are  included  in  brackets  but  are  intended  solely  for  use  in  preparing 
references  from  West  Armenian  forms  of  name  when  this  may  be  desirable. 

2.  This  value  is  used  only  when  the  letter  is  in  initial  position  of  a  name  and  followed  by  a  vowel, 
in  Classical  orthography. 

3.  The  soft  sign  (prime)  is  placed  between  the  two  letters  representing  two  different  sounds  when 
the  combination  might  otherwise  be  read  as  a  digraph  (e.g.,  'brfim.hf,  D'znuni) 

4.  This  value  is  used  only  when  the  letter  is  in  initial  position  of  a  word  or  of  a  stem  in  a 
compound,  in  Classical  orthography. 

5.  Romanization  for  letters  in  Classical  orthography,  sometimes  appears  as  la. . 

6.  Romanization  for  letters  in  Reformed  orthography,  sometimes  appears  as  U  . 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

/ 

Soft  sign  (prime) 

A7 

€ 

ayn 

BO 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

□ 

macron 

E5 

Q 

hachek 

E9 

□ 

dot  below 

F2 
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Assamese 


Vowels  and  Diphthongs  (see  Note  1) 


a 

«lt  a 

$  i 

5?  T 

^  u 

^  u 


Consonants  (see  Note  2) 

Gutturals  Palatals 


9 

r 

o 

* 

1 

o 

e 

* 

ai 

o 

au 

Cerebrals 

Dentals 

* 

ka 

5 

ca 

tr 

kha 

cha 

ga 

6? 

ja 

gha 

jha 

\S 

ha 

<33 

ha 

T> 

ta 

• 

S5 

ta 

& 

tha 

• 

* 

ta 

da 

* 

tha 

ra 

• 

Vf 

da 

U 

dha 

«t 

dha 

u 

rha 

3 

na 

«r 

na 

Labials 


Semivowels 


Sibilants 


Aspirate 


Pa 

51 

ya 

■r 

sa 

5p 

pha 

$ 

ya 

q 

sha 

* 

ba 

ra 

sa 

bha 

»t 

la 

51 

ma 

a 

wa 

Anusvara 


Bisaiga 


m 


h 


Candrabindu 
(anunasika) 
(see  Note  3) 


Abagraha 
(see  Note  4) 


□  n,  m 


z  » 

(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  v  )  called  hasanta  or 
birama. 

3.  Candrabindu  before  gutteral,  palatal,  cerebral,  and  dental  occlusives  is  transliterated  n. 
Before  labials,  sibilants,  semivowels,  the  aspirate,  vowels,  and  in  final  position  it  is 
transliterated  in. 

4.  When  doubled,  abagraha  is  transliterated  by  two  apostrophes  (  ”  ). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

9 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

candrabindu 

EF 

□ 

dot  below 

F2 

g 

circle  below 

F4 

□ 

underscore 

F6 

ALA-LC  Romanization  Tables 


page  23 


Azerbaijani 


Azerbaijani 


Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

(see  Vowels  and 

u 

T*- 

x— < 

b 

mJ 

u 

P 

J 

CL* 

t 

lj 

a j 

s 

3? 

JX. 

c 

c 

2*. 

r 

e 

9 

_?«. 

c- 

c 

h 

J*. 

& 

c 

X 

X 

X 

d 

X 

X 

i 

z 

y 

V 

-/■ 

X 

r 

J 

J- 

Jr 

0 

z 

> 

> 

V 

j 

-<uj 

-AX*. 

s 

-XU 

-XjU- 

0“ 

§ 

u^~ 

o3 

S 

-4^3 

-4^- 

<u**- 

lU*3 

z 

L 

!>. 

-L. 

1= 

t 

L 

Jx 

lx 

Ji 

z 

X 

A 

e- 

d 

‘  (ayn) 

X 

A 

e- 

d 

g 

J 

.i. 

k_A 

<o 

f 

.a 

iL 

v> 

q 

vxL 

k 

3 

g 

J 

1 

J- 

J 

1 

>0 

A. 

P 

m 

O- 

0 

n 

J 

> 

> 

3 

V 

<v. 

o 

h 

-J 

yj- 

yS 

y 
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Vowels  and  Digraphs  Value  Examples 


a 

6 

barakat 

i 

□ 

00* 

sahil 

i 

□ 

00* 

badimcan 

u 

s 

□ 

qudrat 

u 

s 

3D 

buga 

a 

1  ,  T 

aram 

e 

In 

** 

enll 

I 

*39 

dalll 

i 

*3 ‘9 

qaranliq 

T 

i§ 

l 

SiS 

i§iq 

1 

.  31 

0 

3 

on 

U3  1 

u 

j'n 

un 

u3 1 

•• 

on 

ojl 

0 

3  MJ 

ii 

iiziim 

f>3J3^ 

ey 

Isn 

eyvan 

aw 

3'u 

Awhadi 

<3^3l 

a 

In 

avval 

J W 

i 

In 

00* 

istiqlal 

Notes  to  the  Tables 

1.  As  seen  in  the  examples  above,  vowel  harmony,  which  is  found  in  Modern  Turkish,  applies 
to  Azerbaijani  as  well. 

2.  The  letter  in  final  position  may  represent  the  long  vowel  romanized  a,  in  addition  to  the 
vowels  romanized  I  and  1.  This  occurs  in  Arabic  names,  such  as: 

Mustafa 

al-Musanna 

3.  Vowel  points  are  used  sparingly  in  Azerbaijani  publications.  For  romanization,  they  must  be 
supplied  from  a  dictionary. 


RULES  OF  APPLICATION 

Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 

1.  *  ( hamzah ) 

(a)  When  initial,  *  is  not  represented  in  romanization. 

iizdah  °^33 1 
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Azerbaijani 

(b)  When  medial  or  final  in  words  of  Perso-Arabic  origin,  *  is  romanized  as  ’  (alif),  except  when 
it  accompanies  the  phonetic  sound  e  (as  in  men),  in  which  case  it  is  romanized  by  e. 
mas’alah 
but  gecah 
ne?ah 

2.  □  ( maddah ) 

(a)  Initial  T  is  romanized  a. 

ad  am 

(b)  Medial  7,  when  it  represents  the  phonetic  combination  'a,  is  so  romanized. 

Heydar’abad 


A  1  *  aiia 


«A>  _ 

3.  □  (shaddah  or  tashdid)  is  represented  by  doubling  the  letter  or  digraph  concerned. 

saqqal  JISLu 

u> 

Note  the  exceptional  case  where  □  is  written  over  j  and  »3  to  represent  the  combination  of  long 
vowel  plus  consonants. 

madanFyat 

4.  Tanvln  (written  □ ,  % ,  u ,  *□),  which  occurs  chiefly  in  Arabic  words,  is  romanized  un,  in,  an,  and 
an,  respectively. 


5. 


The  consonant  letter  &  at  the  end  of  Arabic  words  in  the  genetive  construction  (i%afah)  is  romanized 
by  t. 

takmilat  al-axbar 


Grammatical  Structure  as  It  Affects  Romanization 

6.  I^afa/i.  When  two  Persian  words  are  used  in  an  Azerbaijani  context  in  a  relationship  known  as 
izafah,  the  first  word  (the  mu^af)  is  followed  by  an  additional  letter  or  syllable  in  romanization. 
This  is  added  according  to  the  following  rules: 

(a)  When  the  mu^af  bears  no  special  mark  of  izafah,  it  is  followed  by  -i. 

Sazman-i  tabligati-Islaml  cdo  J  /j 

(b)  When  the  muzaf  is  marked  by  the  addition  of  <=,  it  is  followed  by  -7. 

Nabigah-’i  dahr 

(c)  When  the  mu^af  is  marked  by  the  addition  of  »3,  it  is  followed  by  -yi. 

darya-yi  nur 
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(d)  Izafah  is  represented  in  romanization  of  personal  names  only  when  implied  in  the  Persian 
script. 


Muclr-i  BeylaqanT 
MaktabT-i  §  TrazT 


Affixes  and  Compounds 

7.  Affixes. 

(a)  When  the  affix  and  the  word  with  with  it  is  connected  grammatically  are  written  separately  in 
Azerbaijani,  the  two  are  separated  in  romanization  by  a  single  prime  ('). 


gecah/lar 

‘alim'lar 

Nacaf'zadah 

Vahabzadah'nm 

Yazigi'fdF 

O  ad  ami  ar 7  fdflar 

hayatdah'kf 


J  ^ 


^  j\ 

\  Li> 


(b)  The  Arabic  article  al  is  separated  by  a  hyphen,  in  romanization,  from  the  word  to  which  it  is 
prefixed. 

maqbarat  al-§u‘ara  I^JJI 

‘Abd  al -Karlin  V* 


Compounds.  When  the  elements  of  a  compound  (except  a  compound  personal  name)  are  written 
separately  in  Azerbaijani,  they  are  separated  in  romanization  by  a  single  prime  ('). 


Islam'  §inas 
but  Naqd  ‘AlF 
‘Avaz  ‘Alf 


p)Lul 
la  4<>y£. 


Orthography  of  Azerbaijani  in  Romanization 

9.  Capitalization. 

(a)  Rules  for  the  capitalization  of  English  are  followed,  except  that  the  Arabic  article  al  is  lower 
cased  in  all  positions. 

(b)  Diacritics  are  used  with  both  upper  and  lower  case  letters  in  romanization. 

10.  Foreign  words.  Foreign  words  in  an  Azerbaijani  context,  including  Persian  and  Arabic  words,  are 
romanized  according  to  the  rules  for  Azerbaijani.  For  short  vowels  not  indicated  in  the  script,  the 
Azerbaijani  vowels  nearest  the  original  pronunciation  of  the  word  are  supplied  in  romanization. 

saffnat  al-nacat 

§ahan§ahl 

kapftalasyon 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

9 

alif 

AE 

C 

ayn 

BO 

1 

Turkish  i  (lower  case) 

B8 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilda 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

□ 

umlaut  (diearesis) 

E8 

g 

cedilla 

FO 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

□ 

underscore 

F6 
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Balinese 

1.  Principal  consonants1 

urn 

in 

(h)a2 

U 

pa 

IQ  _ 

<7 

na 

Ml 

00 

9Jl 

ca 

Ml 

Ml 

dha 

71  u 

ra 

t=? 

(~r\ 

> 

o 

<Q 

ka 

d 

ya 

IQ 

C-\ 

da 

no 

no 

ffa 

\SY| 

\n 

ta 

€il 

e 

raa 

m  ji 

i 

sa 

rn 

n 

ga 

u 

wa 

to 

ba 

CJ 

n 

TU  . 

<*\J 

la 

VO 

Oju 

n 

n 

nga 

2.  Other  consonant  forms3 

na  (na) 

rn 

*a 

ru 

ca  (cha) 

un 

ha 

V01 

Cl 

ta  (tha) 

rcn 

kha 

wi  .ji 

sa  (£a) 

• 

•  • 

u 

fa 

TOl 

XV 

sa  (sa) 

1R 

za 

v3  uO 

pa  (pha) 

rn 

gha 

TWl 

TOO 

ga  (gha) 

LQ 

ba  (bha) 

page  50 

^L4-LC  Romanization  Tables 

3.  Vowels  and  other  agglutinating  signs4 


Balinese 


4.  Numerals 


ya,  ia 


6 


r 


8 


u 


ra 


r£ 


ro 

18 

13 

h 

ng 


ng 


rlf  l 
6 


4 

9 


«1 


5 


o  0 


Notes 

1.  Each  consonant  has  two  forms,  the  regular  and  the  appended,  shown  on  the  left  and  right 
respectively  in  the  romanization  table.  The  vowel  a  is  implicit  after  all  consonants  and 
consonant  clusters  and  should  be  supplied  in  transliteration,  unless: 

(a)  another  vowel  is  indicated  by  the  appropriate  sign;  or 

(b)  the  absence  of  any  vowel  is  indicated  by  the  use  of  an  adeg-adeg  sign  ( n. ).  (Also 

known  as  the  tengenen  sign;  paten  in  Javanese.)  { 
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2.  This  character  often  serves  as  a  neutral  seat  for  a  vowel,  in  which  case  the  h  is  not 
transcribed.  Generally  speaking,  ha  in  word-initial  or  vowel-medial  position  in  a  root  word, 
is  transcribed  without  the  h.  Root  word-final  ha  followed  by  suffixal  vowels,  on  the  other 
hand,  is  always  written  with  h.  When  questions  arise  as  to  whether  the  h  should  be 
represented,  consult  a  standard  dictionary  of  Balinese  in  Latin  script. 

3.  "Other  consonant  forms"  refers  to  aksara  wayah  ("on  the  left")  and  aksara  rekan  ("on  the 
right").  The  aksara  rekan ,  which  are  most  frequently  encountered  in  texts  originating  from 
north  Bali  and  Lombok,  are  used  to  indicate  phonemes  alien  to  Balinese,  particularly  words 
of  Arabic  or  Dutch  origin.  When  used  in  purely  Balinese  words,  the  aksara  wayah  are 
similar  to  capital  letters  and  have  an  honorific  effect  which  is  not  preserved  in  standard 
romanization.  In  words  of  Sanskrit  origin,  or  in  writing  Sanskrit,  Old  Javanese,  or  Old 
Balinese  text,  the  same  characters  represent  aspiriated  or  other  consonants  and  should  be 
romanized  with  the  alternative  equivalents  provided  in  parentheses. 

4.  Vowels  are  almost  always  indicated  by  one  of  a  class  of  agglutinating  signs 

( pangang ge-suara )  added  above,  below,  before,  or  after  the  consonant  or  consonant  cluster 

which  they  affect.  Other  signs  are  used  to  indicate  the  various  forms  of  the  semi-vowels  r, 
/,  and  y ,  as  well  as  the  consonants  h  and  ng,  when  they  occur  in  certain  positions  within  a 
syllable.  Free  standing  vowels  (shown  to  the  right  in  romanization  table)  are  most  commonly 
found  in  initial  syllables  only.  No  difference  between  vowels  indicated  by  free  standing 
characters  and  those  represented  by  agglutinating  signs  is  preserved  in  romanization. 

5.  Sometimes  this  character,  the  a-kara,  is  used  as  a  neutral  seat  that,  when  marked  with  the 

appropriate  sign,  can  also  be  transliterated  as  i,  u,  etc.  Thus  >A  is  romanized  i,  and  so 

forth. 


6.  This  character,  the  nania,  is  used  in  consonant  clusters  within  words.  Consonant  clusters 
between  words  formed  when  the  second  word  begins  with  y  use  the  ordinary  appended  form 
of  ya.  In  romanization  these  two  forms  are  not  distinguished.  In  some  words  (for  example: 
siap),  the  nania  is  transcribed  as  ia  instead  of  ya.  Familiarity  with  the  orthographical 
conventions  is  necessary  to  assure  proper  romanization  of  such  words. 

7.  This  mark,  the  ledung,  forms  optional  ligatures  with  all  letters  except  ba,  nga,ja,  and  nya, 

slightly  changing  their  morphology.  For  example,  ^  tCi  O  may  also  be  written  . 

8.  This  character,  the  surung-i,  takes  the  special  form  c  following  the  suku-i.  Thus  Qc 

is  romanized  rhi.  ^ 


9.  This  character,  the  suku-u,  takes  the  form  V with  certain  consonants  and  semi-vowels. 
Thus  - ... , 


. ,  is  romanized  hru  and  7Q  s  is  romanized  nyu. 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

< 

ayn 

BO 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

0 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dieresis  (umlaut) 

E8 

□ 

dot  below 

F2 
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Belorussian 


Vernacular 

Romanization 

Vernacular 

Romanization 

Upper  case  letters 

Lower  case  letters 

A 

A 

a 

a 

B 

B 

6 

b 

B 

V 

B 

V 

r 

H 

r 

h 

r 

G 

r 

g 

a 

D 

a 

d 

E 

E 

e 

e 

E 

15 

e 

o 

io 

x 

3 

Z 

3 

z 

I 

I 

i 

i 

M 

T 

M 

V 

l 

K 

K 

K 

k 

n 

L 

n 

1 

M 

M 

M 

m 

H 

N 

H 

n 

o 

O 

O 

o 

n 

P 

n 

P 

p 

R 

P 

r 

c 

S 

c 

s 

T 

T 

T 

t 

y 

U 

y 

u 

y 

0 

y 

u 

<f> 

F 

f 

X 

Kh 

X 

kh 

n 

Ts 

U 

ts 

H 

Ch 

M 

ch 

in 

Sh 

111 

sh 

m 

Shch 

m 

shch 

bi 

Y 

bi 

y 

b 

1  (soft  sign) 

b 

1  (soft  sign) 

3 

E 

3 

e 

K) 

ItJ 

KD 

o 

1U 

n 

1A 

a 

<“>• 

xa 

page  34  ALA-LC  Romanization  Tables 


Belorussian 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

□ 

breve 

E6 

□ 

dot  above 

E7 

e 

ligature,  1st  half 

EB 

0 

ligature,  2nd  half 

EC 
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Bengali 


Vowels  and  Diphthongs  (see  Note  1) 


«rt 


a 

a 

i 

T* 

1 

U 

u 

r 


*  o 

*  1 

o 

if)  e 

J?  ai 

'Q  O 

£  au 


Consonants  (see  Note  2) 

Gutturals  Palatals  Cerebrals  Dentals 


ka 

5 

ca 

ta 

ta 

«r 

kha 

cha 

tha 

• 

<s. 

ta 

ga 

ja 

© 

da 

tha 

gha 

jha 

^5 

ra 

da 

© 

ha 

<fP 

ha 

dha 

• 

*t 

dha 

15 

rha 

• 

5R 

na 

na 

• 

Labials 


Semivowels 


Sibilants 


Aspirate 


*r 

pa  ^ 

ya 

sa 

pha  ^ 

ya  ? 

sha 

ba(See  Note  3)  ? 

ra  ?! 

sa 

bha  e'f 

la 

ma  ? 

ba  (see  Note  3) 

Anusvara 


?  m 


Bisarga 


z  h 


Candrabindu 
(anwiasika) 
(see  Note  4) 


□  n,m 


Abagraha 
(see  Note  5) 


5  » 

(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  symbol  (  '  )  called 
hasanta  or  birama. 

3.  3  is  used  both  as  a  labial  and  as  a  semivowel.  When  it  occurs  as  the  second  consonant 
of  a  consonant  cluster,  it  is  transliterated  va.  When  3  is  doubled,  it  is  transliterated  bba. 

4.  Candrabindu  before  guttural,  palatal,  cerebral,  and  dental  occlusives  is  transliterated  n. 
Before  labials,  sibilants,  semivowels,  the  aspirate,  vowels,  and  in  final  position  it  is 
transliterated  in. 

5.  When  doubled,  abagraha  is  transliterated  by  two  apostrophes  (  ”  ). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

> 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

/V 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

a 

candrabindu 

EF 

□ 

dot  below 

F2 

□ 

underscore 

F6 
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Bulgarian 


Vernacular 

Romanization 

Vernacular 

Romanization 

Upper  case  letters 

Lower  case  letters 

A 

A 

a 

a 

B 

B 

6 

b 

B 

V 

B 

V 

r 

G 

r 

g 

n 

D 

a 

d 

E 

E 

e 

e 

>K 

Zh 

3K 

zh 

3 

Z 

3 

z 

M 

I 

M 

i 

H 

I 

M 

V 

i 

K 

K 

K 

k 

n 

L 

n 

1 

M 

M 

M 

m 

H 

N 

H 

n 

O 

O 

O 

o 

n 

P 

n 

P 

p 

R 

p 

r 

c 

S 

c 

s 

T 

T 

T 

t 

y 

U 

y 

u 

<i> 

F 

$ 

f 

X 

Kh 

X 

kh 

u 

TS 

u 

ts 

M 

Ch 

M 

ch 

HI 

Sh 

UJ 

sh 

m 

Sht 

m 

sht 

U  (see  Note  1) 

T> 

u  (see  Note  1) 

B 

[disregarded] 

b 

[disregarded] 

% 

ffe  (see  Note  2) 

t 

fe  (see  Note  2) 

K> 

fD 

K) 

1U 

fA 

a 

o 

ia 

X 

U  (see  Note  2) 

5* 

u  (see  Note  2) 
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Notes 

1.  "b  is  romanized  as  u  when  it  appears  in  the  middle  of  a  word.  It  is  disregarded  when  it  appears 
at  the  end  of  a  word. 

2.  Letter  is  found  chiefly  in  older  texts. 

CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

breve 

E6 

0 

ligature,  first  half 

EB 

0 

ligature,  second  half 

EC 

0 

candrabindu 

EF 
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Burmese 


1.  Consonants 


CD 

ka 

§ 

ta 

o 

Pa 

3 

kha 

s 

tha 

o 

pha 

0 

ga 

3 

da 

o 

ba 

CO 

gha 

o 

dha 

CO 

bha 

c 

ha 

CTO 

na 

© 

ma 

© 

ca 

CD 

ta 

CO 

ya 

30 

cha 

CD 

tha 

s 

ra 

C 

ja 

3 

da 

CO 

la 

jha 

© 

dha 

o 

va 

£ 

nna 

? 

na 

CO 

sa 

e 

ha 

oo 

ha 

g 

la 

33 

’a 

Vowels  (see  Note  1) 

cP,  □! 

□  a 

©□ 

□e 

CO 

i 

o 

□ 

□  i 

\ 

□ 

□ai 

ai 

T 

1 

e 

□ 

□  T 

©□o,©n1 

□  o 

e 

U 

□  ,  □[ 

□  u 

GDO^gdT 

□  o‘ 

0 

e 

u 

□  ,  nil 

ii 

□  u 

o  O 

□  ,  D\ 

i  L 

□  ui 

e 

e 

S 

o 

G  g0C 

o‘ 

Medials 

□J 

□y 

§ 

□  r 

□ 

□v 

□ 

□  h 

A 

J 

Medials  are  written 

in  the  order: 

y  or  r  -  v  -  h;  for  example: 

myha  g  krva  lvha  |ocs  rnrvhan4" 


Notes 

1.  Some  symbols  in  the  Burmese  script  are  attached  above,  below,  or  beside  a  consonant. 
Where  such  symbols  are  listed  in  this  table,  the  symbol  is  used  to  represent  the 
consonant. 
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2.  Absence  of  a  vowel  symbol  in  the  script  is  represented  by  a  in  romanization,  with  the 
following  exceptions: 

(a)  when  a  different  vowel  is  indicated  by  its  appropriate  symbol  (see  2.  Vowels); 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  symbol  □  (see  5.  Final  symbols). 


4.  Conjunct  consonants 

Romanize  an  upper  consonant  before  a  lower  one: 


cocoDDco  takkasuil*  tiracchan*  kambha 

Note  that  the  following  consonants  have  modified  forms  when  conjunct: 


£ 

□ 

fiQ 

£.1 

□  njh 

L 

□  ddh 

CD 

or 

□  nd 

a 

□jjh 

c 

a 

□  tth 

OD 

CO 

□  nth 

CCD 

□  ss 

5.  Final  Symbols 

fi 

fi 

□m 

6.  Tone  Marks 

V 

□s 

□// 

Examples: 

o 

O 

•L 

puix 

o 

OS 

L 

pui/7 

7.  Punctuation 

1 

,  (comma) 

II 

(period) 

8.  Numerals 


The  numerals  are:  o  (0),  o  (1),  J  (2),  ?  (3)>  9  (4),  D  (5),  g  (6),  9  (7),  o  (8),  g  (9). 

9.  Abbreviations 

<^i  e* 

3  n* 

10.  Word  Division 


For  Burmese  words,  leave  a  space  after  each  syllable. 
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For  loanwords,  use  the  same  word  division  as  in  the  original  language.  Apply  the  same  practice  to 
loanwords  with  modified  forms  in  Burmese. 


c©  © 

G  CO  OO  CO 

ko£mItr 

( English ) 

ef  e 

upus£ 

(modified  Pali ) 

o^soo| 

pan£//kan£ 

(Mon) 

GCOO^ 

ko£pran£  1 

(Chinese) 

11.  Capitalization 

Capitalize  words  according  to  the  rules  that  apply  to  English. 

For  personal  names  composed  of  Burmese  elements,  or  of  elements  treated  as  Burmese,  capitalize 
the  initial  letter  of  each  syllable. 

goocgoocoiodc  Mon‘  Mon‘  Cui"  Tan*' 

gooocgooc  Co  Cuin‘  Moh‘ 

ceo  c 

Khyan*  Rl  Cin‘ 


Note  the  following  names  with  non-Burmese  elements: 
\  c  o  c 


co^cocoo^ 


ODCOO 


OGSCDCpCO 


Kainak*  Bha  Cin‘ 
Mra  Sita 
Padesaraja 


12.  Examples 


oc  cpe  c  nc 
«OOOCO«CSGCOO0SgD|GOICS 

c  c©  o  o  c 
GCD033C0CS0^gS0SC^C 

3golyroocs[cgsoOT[ 


Maha  samuin£//  to‘  krT//  nnvan£/  pon£// 
Yo  ‘A  tva n‘//  van‘  U7/  Phui^  Lhuin£ 
Dutthagamani  man£//  krT^  vatthu 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

* 

asterisk 

42 

/ 

soft  sign  (prime) 

A7 

5 

alif 

AE 

< 

ayn 

BO 

ft 

hard  sign  (double  prime) 

B7 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

0 

tilde 

E4 

0 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 
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Chinese 


ROMANIZATION,  CAPITALIZATION,  AND  PUNCTUATION 

1.  Romanization 

a)  The  Wade-Giles  system  of  romanization,  as  employed  in  Herbert  A.  Giles’^  Chinese-English 
Dictionary  (2nd  edition,  London,  1912),  is  used  with  the  following  modifications  and  rules  of 
application: 

1)  Omit  the  diacritic  marks  breve  (  □  )  and  circumflex  (  □  ); 

2)  The  phonetically  obsolete  final  -io  (syllabic  yo )  and  the  phonetically  redundant  syllabic 
yi  and  e  are  eliminated; 

3)  The  national  or  Kuo-yii  (Peking)  standard  pronunciation  is  followed  in  the  reading  of 
characters  and  in  making  necessary  semantic  distinctions  between  multiple  readings  of 
single  characters,  according  to  the  usage  of  Kuoyu  tz‘u  tien  (Gwoyeu  tsyrdean )  Shanghai, 
1937  (reprinted  in  Taipei,  1959),  the  romanized  orthography  of  the  latter  being 
modified  to  conform  to  Wade-Giles1 

4)  colloquial  reading  are  normally  preferred  to  "literary"  ones,  and  those  designated  by 

Kuo  yii  tz‘u  tien  as  "another  reading"  (  %  %%  )  are  not  used,  except  as  required  by 

usage  in  special  compounds.  A  list  of  modifications  is  appended. 

b)  Titles  of  works  written  in  classical  Chinese  by  Japanese  or  Korean  authors  (i.e.,  in  Kambun 
or  Hanmun)  are  romanized  as  Japanese  or  Korean,  respectively;  titles  of  Japanese  or  Korean 
editions  of  works  written  in  Chinese  by  Chinese  authors  are  romanized  as  Chinese  if  "reading 
marks"  or  textual  matter  in  Japanese  or  Korean  have  not  been  added.  Reference  is  made  from  the 
alternative  romanization  when  such  titles  are  traced  as  added  entries. 


2.  Capitalization 

a)  The  first  word  of  a  corporate  name  is  capitalized.  The  first  word  of  the  name  of  a  corporate 
subdivision  appearing  in  conjunction  with  the  name  of  the  larger  body  is  capitalized  only  when  used 
in  headings. 

b)  Geographical  names  and  the  names  of  dynasties  are  capitalized. 

c)  The  first  word  of  the  title  of  a  book,  periodical,  or  series  is  capitalized. 

d)  Only  the  first  syllable  of  hyphenated  surnames  and  given  names  or  courtesy  names  in  two 

syllables  (two  characters)  is  capitalized. 

3.  Punctuation 

a)  A  centered  point  (  •  )  indicating  co-ordinate  words  is  transcribed  as  a  comma. 

b)  Brackets  (  r  ...  J  )  used  in  the  manner  of  quotation  marks  ("...")  are  transcribed  as  the 

latter. 


XCf.  Concise  Dictionary  of  Spoken  Chinese,  by  Yuen  Ren  Chao  and  Lien  Sheng  Yang  (Harvard  University  Press,  1947)  p.  279-282 
(Concordance  of  Wade-Giles  and  National  Romanization)  and  p.  255-278  (Index  Proper)  and  Chinese  Character  Indexes,  by  Ching-yi 
Dougherty,  Sydney  M.  Lamb,  and  Samuel  E.  Martin  (University  of  California  Press,  1963)  v.  1,  p.  xxii-xxix. 
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4.  Word  Division 


Each  character  is  romanized  as  a  separate  word. 


Exceptions: 

1)  The  romanized  parts  of  multicharacter  surnames  and  given  names  are  hyphenated. 

2)  The  romanized  parts  of  multicharacter  geographic  names  are  hyphenated  when  these 
names  appear  in  titles  or  as  part  of  corporate  names  (except  for  geographical  generics,  which 
appear  as  separate  words). 

Chung-kuo  wen  hsueh  chia  lieh  chuan 

Shang-hai  shih  chih  nan 

Shih  nien  lai  chih  T‘ai-wan  ta  hsueh 


Wo  so  jen  shih  ti  Chiang  Chieh-shih 


Table  of  Modifications  to  Romanizations 


Number 

Giles’  Romanization, 
Character,  and  Number 

Reading  and  page  in 

Kuo  yii  tz%i  tien 

ALA-LC  Romanization 

1 

ch‘a 

& 

193 

a)  Ja 

(SE) 

2595 

a)  Cha  (as  a  surname) 

b)  char 

(X) 

2793 

b)  ch‘a 

2 

chao 

482 

jwo 

(40 

2726 

cho 

3 

chao 

n 

493 

chaur 

m 

2814 

ch‘ao 

4 

cheng 

699 

a)  jen 

(R) 

2649 

chen 

b)  jeng 

(iE)  5U8 

2681 

5 

ch'eng 

m 

770 

a)  chemg 

( ffi ) 

2892 

a)  ch‘eng 

(read  sheng  or  cheng 

b)  sheng 

(4) 

3185 

b)  sheng  (in  the  com- 

as  a  numerative  of 

pounds  Jz  in  the 

vehicles  and  temples 

Buddhist  sense  of 

or  annals) 

vehicle  and  £  ^ 

to  mean  annals) 

6 

chi 

m 

905 

a)  jyi 

(»)  XI# 

1781 

b)  jiuh 

(») 

1997 

chii 

7 

ch‘i 

m 

1007 

a)  chi 

(«)  XM 

2032 

b)  shi 

(^)  & 

2232 

hsi 

8 

ch‘i 

m 

1009 

a)  chi 

(m  xa 

2032 

(also  read  hsi ) 

b)  shi 

(^) 

2232 

hsi 
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Number 

Giles’  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yiitz'u  tien 

9 

ch‘i 

m 

1066 

a)  ji 

(PA 

1761 

chi 

(also  read  chi ) 

b)  chyi 

(») 

2051 

10 

ch‘i 

no 

1098 

jye 

(*) 

1833 

chieh 

(also  read  chieh) 

11 

ch‘i 

ft 

1104 

jih 

(») 

1786 

chi 

12 

ch‘i 

1119 

shih 

(#) 

2259 

hsi 

13 

ch‘iai 

tit 

1203 

a)  kae 

(ffl) 

1474 

k‘ai 

(also  pronounced  k‘ai) 

b)  jie 

(«?) 

XII 

1822 

14 

chiang 

m 

1240 

cheang 

m 

2141 

ch‘iang 

15 

chiang 

m 

1245 

gaang 

(W) 

1313 

kang 

16 

chiao 

m 

1353 

a)  jiaw 

m 

xii 

1866 

b)  shiaw 

m 

2333 

hsiao 

17 

ch‘iao 

% 

1387 

a)  chiau 

(t») 

2077 

a) 

ch‘iao  (in  the  com¬ 
pounds  , 

%  £) 

b)  cheau 

(«) 

2086 

c)  chiueh 

(IX) 

2194 

b) 

ch‘iieh 

18 

chieh 

\% 

1446 

a)  jya 

(flu) 

1812 

b)  shya 

(*S) 

2263 

hsia 

19 

chieh 

% 

1493 

a)  jye 

m 

1836 

a) 

chieh  (in  the  sense 
of  pure,  clean) 

b)  shye 

(M) 

2288 

b) 

hsieh  (in  the  sense 
of  to  regulate, 
to  adjust) 

20 

chieh 

m 

1551 

chih 

(ax) 

2068 

ch‘i 

21 

ch‘ieh 

m 

1576 

kae 

m 

1475 

k‘ai 

22 

chien 

IX 

1633 

chiann 

m 

2123 

ch‘ien 

23 

chien 

n 

1648 

a)  jiuann 

(») 

2014 

a) 

Chuan  (as  a 

(as  a  surname:  Chiian) 

surname) 

b)  jiunn 

m 

2026 

b) 

chiin 

24 

chien 

m 

1649 

jiuan 

m 

2010 

chuan 

ALA-LC  Romanization  Tables 


page  45 


Chinese 


Number 

Giles’  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yii  tzti.  tien 

25 

chih 

*6 

1886 

a)  dii 

(®) 

622 

ti 

b)  jyy 

(Z) 

XI* 

2579 

26 

ch‘ih 

m 

1963 

jyh 

(Z) 

2592 

chih 

27 

ch‘ih 

« 

2011 

shyh 

(W) 

47 

shih 

28 

ch‘in 

n. 

2085 

a)  chiin 

(ft) 

X  SB 

2132 

(read  ch ' 

ien: 

b)  chian 

($) 

2108 

ch‘ien 

to  engrave ) 

29 

ch‘in 

2090 

jinn 

(&) 

1935 

chin 

30 

ching 

W: 

2143 

a)  jiing 

(») 

1968 

a) 

ching 

(read  ying: 

b)  yiing 

(*) 

IS]  ^ 

4164 

b) 

ying  (in  the 

a  shadow) 

compound  f*  Efj  ) 

31 

ching 

m 

2163 

chin 

(&) 

1911 

chin2 

32 

ch‘ing 

JSsC 

2202 

a)  jing 

(w) 

1952 

ching 

b)  chyng 

(e) 

XI# 

2168 

33 

ch‘iung 

m 

2377 

jeong 

(a) 

2027 

chiung 

34 

ch‘iung 

a 

2380 

jeong 

(H) 

2028 

chiung 

35 

chu 

Hi 

2612 

shuh 

(*) 

3215 

shu 

36 

ch‘u 

t* 

2625 

shu 

(«) 

3190 

shu 

37 

ch‘u 

n 

2627 

shu 

(») 

3199 

shu 

38 

ch‘u 

m 

2628 

tswu 

(’€) 

3518 

ts‘u 

39 

ch‘u 

ir 

2669 

a)  shiuh 

(sS) 

2493 

a) 

hsii 

(also  read  hsii,  and 

b)  chuh 

(*») 

2914 

b) 

ch‘u  (in  the  com- 

used  with 

®  4744) 

pounds  £  , 

IS  II  , 

40 

chuang 

2758 

a)  joang 

Off) 

2743 

a) 

chuang 

b)  tzanq 

(«) 

3411 

b) 

tsang  (used  in  the 
name  of  the  Bud- 

41 

chui 

2807 

chwei 

Ofc) 

2924 

dhist  priest  T  ) 
ch‘ui 

2 

The  U.S.  Board  on  Geographic  Names  lists  the  name  5=  .g  as  T'ien-ching,  because  it  follows  Giles’  romanization. 
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Number 

Gi/es'  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yii  tz‘u  tien 

59 

hsiao 

# 

4332 

a)  shyau 

(Ml) 

2308 

b)  yau 

(IS) 

4009 

yao 

60 

hsiao 

ft 

4333 

a)  shyau 

m 

Xfft 

2308 

b)  yau 

(IS) 

4009 

yao 

61 

hsieh 

4371 

shiue 

m 

2497 

hsiieh 

62 

hsieh 

4381 

hair 

m) 

1569 

hai 

63 

hsien 

If 

4474 

shiuan 

(»£) 

2514 

hsuan 

64 

hsien 

IS 

4510 

a)  kaan 

m 

1488 

a) 

k‘an  (in  the  com- 

(also  read  k‘an ) 

pound  H  tSr ) 

b)  jiann 

(M) 

1908 

b) 

chien 

65 

hsien 

4511 

jiann 

CM) 

1909 

chien 

66 

hsin 

¥ 

4565 

a)  shin 

m 

x  m 

2386 

a) 

hsin  (in  the  com- 

(read  Shen,  name 

pound  W  [folk 

of  a  place) 

medicine]  or  as  a 

geographic  name 

¥  iff 

[in  Shanghai] 

b)  shen 

(*) 

3127 

b) 

shen 

67 

hsing 

« 

4616 

a)  shin 

m 

2399 

hsin 

b)  shing 

(«) 

x# 

2454 

68 

hsu 

tiifii 

4762 

yeu 

0£) 

4372 

yii 

69 

hsuan 

*£ 

4830 

a)  shiuan 

m 

X IA 

2515 

b)  shiun 

(M) 

2532 

hsun 

70 

hsiin 

4897 

jiunn 

(ft) 

2023 

chun 

71 

hsun 

4899 

jiunn 

(ft) 

2025 

chun 

72 

huan 

5062 

a)  hoan 

(Sf) 

1709 

b)  woan 

(*) 

4281 

wan 

73 

huang 

5146 

kuanq 

(W) 

1525 

k‘uang 

(interchanged  with 

M 

6412  [k‘uang]) 
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Giles’  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yii  tz‘u  tien 

74 

hui 

5184 

a)  hoei 

1696/99 

a)  hui 

(read  kuei:  to  calcu- 

b)  guey 

m) 

1388 

b)  kuei  (in  the  geo- 

late;  a 

district  of) 

graphic  name  #  f § 

Chekiang) 

c)  kuay 

m 

1509 

c)  k‘uai  (in  the  com- 

pound  #  It 
and  surname) 

75 

hui 

CJtL 

m 

5206 

a)  kuey 

m 

1514 

k‘uei 

b)  huey 

(*) 

xn 

1703 

76 

hui 

m 

5207 

a)  guey 

m 

1387 

b)  kuey 

(*§) 

1514 

k‘uei 

77 

hun 

m 

5240 

kuen 

1520 

k‘un 

78 

hung 

m 

5287 

a)  horng 

(*d 

XII 

1749 

b)  rong 

(«) 

3345 

jung 

79 

i 

m 

5361 

chyi 

(*o 

2046 

ch‘i 

80 

i 

m 

5426 

nii 

(iE) 

909 

ni 

(also  read  ni) 

81 

i 

5434 

ni 

0£) 

908 

ni 

(also 

read  ni) 

82 

i 

m 

5509 

a)  yih 

m 

3*  ~TT* 

pR  E3 

3961 

b)  yeh 

UP) 

3997 

yeh 

83 

jao 

5579 

a)  rau 

(«) 

XU 

3267 

b)  nau 

(B$) 

881 

nao 

84 

je 

is 

5590 

ruoh 

m 

3324 

jo 

85 

jo 

£ 

5644 

a)  ruoh 

m 

3324 

a)  jo 

b)  ree 

(*) 

3263 

b)  je  (in  the  com- 

pounds  <£  %  > 

®  %  ,  etc.) 

86 

jo 

m 

5645 

ree 

m 

3264 

je 

87 

jo 

5649 

reh 

(S) 

3264 

je 

88 

jung 

5742 

yng 

(*) 

4160 

ying 

(also  read  ying) 

89 

jung 

% 

5743 

yng 

m 

4160 

ying 

90 

jung 

% 

5744 

yng 

m 

4160 

ying 

(also  read  ying) 

91 

ko 

6039 

ke 

m 

1453 

k‘o 

(read  k‘o  in  Peking) 
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Number 

Giles’  Romanization , 

. 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yii  tzii  tien 

92 

ko  &£ 

6058 

a)  ger 

(If) 

1254 

a) 

ko 

b)  har 

(»&) 

1540 

b) 

ha  (in  the  com¬ 
pound  H  ) 

93 

k‘uei  1% 

(also  read  hui) 

6481 

huei 

m 

1687 

hui 

94 

kun  IS 

6525 

kuen 

(a) 

1520 

k‘un 

95 

kung  g 

(also  read  chiung ) 

6584 

jiong 

(») 

2027 

chiung 

96 

kung  JH 

6585 

jiong 

(iS) 

2027 

chiung 

97 

kung  m. 

6588 

a)  goong 

(£) 

X  W 

1441 

(read  k^uang  in  Peking) 

b)  kuanq 

(g) 

1527 

k‘uang 

98 

li 

6952 

a)  lii 

m 

x  ia 

1059 

(also  read  lii) 

b)  leu 

(&) 

1236 

lii 

99 

liao  7 

7073 

a)  le 

(W) 

973 

a) 

le  (when  used  as  a 
suffix  in  the  sense 

of  completed  action 
or  change  of  status, 
e.g.,  ) 

b)  leau 

m 

1086 

b) 

liao 

100 

lieh  $$ 

7094 

leh 

m 

970 

le 

101 

lieh  # 

7095 

a)  leh 

m 

970 

a) 

le 

b)  leu 

(a) 

1237 

b) 

lii  (in  the  sense 

of  WttM) 

c)  lhuo 

(») 

1192 

c) 

luo  (in  the  sense 
of  $$»!$) 

102 

lo  fij 

7316 

a)  leh 

(OJ) 

968 

a) 

le 

(also  read  /ei) 

b)  lhei 

(«) 

980 

b) 

lei  (in  the  sense 
of  to  tighten  up 
with  string  or  rope) 

103 

lo  m 

7331 

a)  leh 

m 

970 

a) 

le 

b)  law 

(#) 

1008 

b) 

lao  (in  the  geogra¬ 
phical  name  fj*  3§s) 

(read  yao:  to  find 

c)  yao 

(«) 

4022 

c) 

yao  (in  the  senten- 

pleasure  in) 

ces  \z  #  Ut  LlJ ,  # 

(read  yiieh:  music) 

d)  yueh 

(tt) 

4417 

d) 

yiieh  (to  mean  music, 
or  used  as  surname) 

104 

lung  # 

7507 

a)  now 

(#*) 

887 

(also  read  nung. 

b)  neng 

(g) 

xia 

903 

nou,  and  neng) 

c)  nonq 

(«) 

isua 

945 

nung 

d)  long 

(fiR) 

1230 
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Giles’  Romanization, 

Reading  and  page  in 

AL4-LC  Romanization 

Character,  and  Number 

Kuo  yii  tz‘u  tien 

105 

meng  eg 

7790 

a)  mang 

(It) 

343 

a) 

mang  (in  the  com¬ 
pound  <EtR) 

b)  meng 

(*) 

348 

b) 

meng 

106 

miu  B 

7965 

a)  mou 

ill)  *5  >:■ 

329 

(also  read  niu,  etc.) 

b)  Miaw 

(  h) 

365 

Miao  (as  a  surname) 

c)  miow 

im 

366 

d)  muh 

(ft)  AifSii 

401 

107 

mo 

7998 

meau 

m 

363 

miao 

108 

mo  ol 

8016 

a)  moh 

(Si) 

297 

a) 

mo 

(also  read  mu; 

b)  mei 

(m 

310 

b) 

mei  (as  a  negation 

a  negation  of 
in  which  sense 

in  colloquial  usage) 

also  read  mei ) 

109 

mou  ffi: 

8043 

a)  maw 

(*) 

327 

mao 

b)  mou 

at)  xm 

329 

110 

mou  wX 

8050 

a)  moou 

(it)  x  m 

329 

(also  read  mu) 

b)  muu 

(m) 

390 

mu 

111 

mu 

(also  read  mo) 

8064 

mo 

(®) 

286 

mo 

112 

mu  fl 

8066 

a)  mo 

(*)  xn 

286 

a) 

mo  (in  the  com¬ 
pounds  f  &$) 

b)  mu 

(ft) 

390 

b) 

mu 

113 

mu  % 

8085 

a)  muh 

(ft)  Xtfl 

401 

(also  read  wu) 

b)  wuh 

(*) 

4221 

wu 

114 

nio  ig 

8343 

niueh 

(») 

951 

niieh 

115 

no  nft 

8371 

nah 

(It) 

867 

na 

(read  na:  to  blurt  out) 

116 

pan  15; 

8591 

a)  ban 

(*) 

89 

a) 

pan 

b)  parn 

(ift) 

213 

b) 

p‘an  (in  the  com¬ 
pounds  ,  tt.fi 

c)  bo 

(«0 

22 

c) 

po  (in  the  com¬ 
pound  IS:  2? ) 

117 

pan  B# 

(also  read  p‘an) 

8606 

pann 

(*) 

217 

p‘an 

118 

pao  Ri 

(also  read  p‘u) 

8726 

puh 

(M) 

273 

p‘u 
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119 

120 
121 

122 

123 

124 

125 

126 

127 

128 

129 

130 

131 

132 

133 

134 
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pei 

tt 

8770 

a)  pyi 

(lit) 

234 

a) 

p‘i  (in  IDt ,  to  Hfe  ) 

(used  for 

*8i  9408) 

b)  po 

(M0 

193 

b) 

p‘o  (in  K  Pt ,  or 
when  used  for  to ) 

p‘ei 

3S 

8818 

Pi 

(tit) 

230 

P‘i 

peng 

8875 

a)  bang 

m 

(*) 

105 

a) 

pang 

b)  bin 

150 

b) 

pin  (when  used  as 
short  form  of  ) 

Pi 

W 

8928 

a)  bih 

(no 

115 

a) 

pi  (in  the  geogra¬ 
phic  name  :S  F Sj  ) 

b)  mih 

m 

357 

b) 

mi 

Pi 

to 

8932 

a)  bih 

(It)  Xlfl 

115 

a) 

pi  (in  the  geogra¬ 
phic  name  to 

Peru) 

b)  mih 

m 

357 

b) 

mi 

ping 

m 

9290 

pin 

(  nn  ) 

249 

p‘in 

p‘ing 

n 

9326 

a)  pinn 

(ft) 

253 

a) 

p‘in 

b)  pinq 

264 

b) 

p‘ing  (in  the  com¬ 
pound  ^  m  ) 

po 

9345 

poh 

(«o 

196 

P‘o 

(also  read  p‘o ) 

po 

Sic 

9386 

bar 

(E) 

14 

Pa 

(read  pa 

in  Peking) 

po 

m 

9407 

pwu 

(to) 

269 

p‘u 

p‘o 

ft 

9416 

a)  poh 

(to) 

197 

a) 

p‘o  (in  the  com¬ 
pounds  19  ft-  ,  ftffi) 

b)  pwu 

(Si)  m 

268 

b) 

p‘u 

p‘o 

n 

9425 

bor 

(to) 

35 

po 

P‘o 

m 

9443 

pwu 

(M) 

269 

p‘u 

(also  read  p‘u) 

pu 

m 

9458 

puu 

(to) 

270 

p‘u 

p‘u 

m 

9518 

buh 

m 

172 

pu 

shan 

m 

9704 

chaan 

(at) 

2839 

ch‘an 

ALA-LC  Romanization  Tables 


Chinese 


Number 

Giles’  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yu  tz‘u  tien 

135 

shang 

% 

9734 

a)  charng 

2871 

a) 

ch‘ang  (in  the  sense 
of  brightness) 

b)  shang 

(ffi) 

x  m 

3149 

b) 

shang  (in  the  com¬ 
pound  #  $< ) 

136 

shao 

ft 

9769 

jwo 

(m) 

2722 

cho 

137 

shen 

tf 

9828 

chin 

m 

2126 

ch‘in 

138 

shen 

m 

9844 

chern 

m 

2854 

ch‘en 

139 

shen 

it 

9848 

chern 

(M0 

2851 

ch‘en 

140 

shih 

£ 

9941 

a)  shy 

m 

3040 

a) 

shih  (in  the  com¬ 
pound  ifigt) 

b)  chyr 

(«s) 

2778 

b) 

ch‘ih  (to  mean  spoon 
or  when  used  as  a 
surname) 

141 

shih 

m 

9967 

a)  shyr 

m 

3000 

b)  shuoh 

(«) 

is# 

3227 

shuo 

142 

shu 

m 

10038 

ju 

m 

2701 

chu 

143 

shun 

i? 

10139 

chwen 

m 

2943 

ch‘un 

(also 

read  chiin ) 

144 

shun 

H 

10142 

chwen 

m 

2945 

ch‘un 

145 

shun 

tr- 

10148 

chwen 

m 

2945 

ch‘un 

146 

shun 

$•& 

10149 

chwen 

m 

2941 

ch‘un 

147 

shun 

u 

10150 

chwen 

m 

2943 

ch‘un 

148 

shun 

m 

10162 

joen 

m 

2738 

chun 

149 

sou 

m 

10247 

a)  shuh 

(ft) 

BP 

3213 

shu 

(read  shu  colloquially) 

b)  sow 

(13!) 

ft  # 

3614 

150 

su 

m 

10321 

a)  shu 

(ft) 

3197 

shu 

(read  shu  or  su;  to 
set  forth,  to  explain 

b)  su 

(.?£) 

x  m 

3671 

as  a 

running  commentary) 

151 

su 

ik 

10322 

a)  shu 

(#) 

3198 

shu 

b)  su 

( B) 

x  m 

3671 
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Chinese 


Number 

Giles’  Romanization, 

Reading  and  page  in 

ALA-LC  Romanization 

Character,  and  Number 

Kuo  yii  tz’u  tien 

152 

sui 

n 

10417 

tsuey 

(*) 

3530 

ts‘ui 

153 

sun 

n 

10441 

a)  shiunn 

(Sfi) 

2545 

hsiin 

b)  suenn 

(») 

XI* 

3727 

154 

tao 

n 

10798 

a)  daw 

(73) 

X  I* 

588 

b)  taur 

OR) 

730 

t‘ao 

155 

tieh 

a 

11142 

a)  dye 

(*) 

X  I* 

635 

b)  twu 

(±) 

818 

t‘u 

156 

t‘o 

11400 

a)  tah 

(«l) 

5  tn  ii 

i  707 

a) 

t‘a  (in  the  com¬ 
pounds  Va  #  , 
ffin-) 

b)  tuoh 

(ffi) 

832 

b) 

t‘o 

c)  jyr 

U) 

2564 

c) 

chih  (  [h]  fa  ) 

157 

ts‘ang 

Jt 

11594 

chiang 

(it) 

2134 

ch‘iang 

158 

ts‘ao 

tit 

11649 

a)  tzaw 

(ffl) 

3398 

tsao 

b)  tsao 

m 

X  !« 

3502 

159 

tsu 

11851 

tzuoh 

w 

3431 

tso 

160 

tsun 

m 

11964 

jiunn 

m 

2026 

chun 

161 

t‘un 

& 

12241 

duenn 

(#) 

694 

tun 

162 

t‘ung 

ft 

12300 

torng 

(ii) 

857 

a) 

t‘ung 

b) 

chuang  (in  the  com¬ 
pound  (I  ) 

163 

tzu 

12359 

a)  jiee 

(l£) 

1837 

a) 

chieh 

b)  tzyy 

(*) 

1*  m 

3358 

b) 

tzu  (in  the  com¬ 
pounds  fcfjfcfc 

M  ,  WMctE) 

164 

wai 

% 

12448 

uei 

(«) 

4247 

wei 

165 

wai 

12449 

kuey 

(*) 

1514 

k‘uei 

166 

wei 

St 

12621 

a)  wei 

(«) 

4268 

a) 

wei 

(read  Wei :  Wei-ch‘ih 
as  a  surname) 

(read  yii: 

as  a 

b)  Yuh 

(ii) 

4398 

b) 

Yii-ch‘ih  (as  a 

military  official) 

double  surname) 
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Chinese 


Number 

Giles’  Romanization, 
Character,  and  Number 

Reading  and  page  in 

Kuo  yii  tz%i  tien 

ALA-LC  Romanization 

167 

wei 

^  12632 

huey 

Cfc) 

1702 

hui 

168 

wu 

11  12742 

woh 

m 

4235 

wo 

169 

wu 

ft  12793 

a)  wuh 

(,i)  %  m 

4226 

b)  woh 

(K) 

4234 

wo 

170 

yai 

M  12838 

ya 

m 

3975 

ya 

171 

yao 

m  12960 

a)  niueh 

ot) 

951 

a)  niieh 

(also  read  nio ) 

b)  yaw 

(ffi) 

4023 

b)  yao  (in  the  com- 

pound  ) 

172 

yeh 

$  12979 

a)  jye 

(ft)  7.  Sfl 

1833 

a)  chieh  (in  the  com- 

(also  read  chieh 

pound  )Sfi  % ) 

and  hsieh) 

b)  shye 

m 

2288 

b)  hsieh 

173 

yo 

m  13349 

iue 

(B) 

4402 

yueh 

(also  read  yiieh  and 

[Peking]  yao\  to  weigh ) 

174 

yo 

tBr  13362 

yueh 

(ft) 

4419 

yiieh 

175 

yo 

&  13366 

yueh 

(ft) 

4414 

yiieh 

176 

yo 

*  13367 

yueh 

(ft) 

4420 

yiieh 

177 

yo 

m  13371 

yueh 

(«) 

4420 

yiieh 

178 

yuan 

ffi.  13721 

woan 

(*4f ) 

4279 

wan 

(read 

wan:  pleasant, 

genial,  kind,  used  with 

12479 

[wan]) 

179 

yuan 

m  13724 

woan 

(«) 

4280 

wan 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

< 

ayn 

B0 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

a 

umlaut  (dieresis) 

E8 
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Church  Slavic 


Vernacular 

Romanization 

Upper  case  letters 

A 

A 

B 

B 

B 

V 

r 

G 

ZI 

D 

G 

E 

6 

E 

1€ 

ffi 

AC 

Zh 

S 

2 

3 

Z 

M 

I 

ft 

I 

I 

I 

K 

K 

71 

L 

M 

M 

H 

N 

O 

O 

n 

P 

p 

R 

c 

S 

T 

T 

oy 

U 

5 

u 

F 

X 

Kh 

GO 

OT 

CO 

O 

U 

T3 

4 

Ch 

in 

Sh 

m 

Sht 

1> 

"  (hard  sign) 

bl 

Y 

TdI 

Y 

b 

'  (soft  sign) 

"b 

E 

Church  Slavic 

Vernacular  Romanization  Numeric  Value 


Lower  case  letters 


a 

6 

B 

r 

a 

e 

e 

le 

x 

s 

3 

H 

M 

i 

K 

n 

M 

H 

O 

n 

P 

c 

T 

oy 

8 

$ 

x 

(I) 

OJ 

U 

H 

UJ 

41 

T> 

bl 

M 

b 

t> 


a 

b 

v 

g 

d 

e 


1 

2 

3 

4 

5 


e 
< — ' 
le 

zh 

z 

z 

i 

V’ 

l 

T 

1 

k 

1 

m 

n 

o 

P 

r 

s 

t 

u 

u 

f 

kh 

oT 

5 

ts 

ch 

sh 


6 

7 

8 

10 

20 

30 

40 

50 

70 

80 

100 

200 

300 

400 

400 

500 

600 

800 

800 

900 

90 


sht 

"  (hard  sign)  (see  Note  4) 

y 

y 

'  (soft  sign) 
e 
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Vernacular 

Romanization 

Vernacular 

Romanization 

Nun 

Upper  case  letters 

Lower  case  letters 

K> 

K) 

1U 

M 

!A 

la 

o 

ia 

A 

S 

A 

e 

900 

3 

3 

& 

60 

¥ 

Y 

ps 

700 

0 

F 

Q 

f 

9 

w 

V 

* 

\\ 

v>  y 

y  (v  if  used 
without  dia¬ 
critical  marks) 

400 

X 

Q 

Q 

>A 

LA 

O 

IQ 

\K 

JO 

o 

»Q 

T 

Notes 

G 

h 

g 

Numeric  Value 


1.  Diacritical  signs 


Accents  and  aspiration  marks  are  ignored  in  romanization.  Marks  ignored  in  romanization 
include  the  acute  accent  (  □  ),  the  grave  accent  (  □  ),  the  aspiration  mark  (  u ),  or  combinations 
thereof  (  i),)or(  u'). 


The  letters  T  and  v  are  sometimes  printed  with  two  dots  (or  marks),  as  shown.  In  the  first  case, 
the  two  dots  are  used  whenever  no  other  accent  appears,  and  thus  they  have  no  particular 
significance  and  are  ignored  in  romanization.  In  the  second  case,  the  use  of  the  two  dots  (or 
other  diacritical  marks)  indicates  that  the  letter  is  a  vowel,  and  the  letter  is  romanized  y,  as 
provided  for  in  the  table. 

The  symbol  indicating  a  short  i  (  □  )  is  taken  into  account  in  romanization,  as  indicated  in  the 
table,  resulting  in  the  romanization  i. 

2.  Abbreviations 


Abbreviated  words  are  transcribed  in  full,  without  the  use  of  brackets.  The  most  common 
symbol  for  abbreviation  is  ci .  Sometimes,  omitted  letters  are  placed  above  other  letters  in  the 
word  in  Church  Slavic  texts  (e.g.,  <I>,  e,  v),  the  letter  added  being  sometimes  given  with  a  special 
symbol  (e.g.,  rab).  The  symbol  above  a  letter  indicates  the  omission  of  t>  or  b. 

3.  Numerals 

Numerals  are  represented  in  Church  Slavic  by  letters  (as  shown  in  the  romanization  table)  with 
the  addition  of  the  signs  ^  O  and  .  The  addition  of  an  abbreviation  sign  (□)  indicates 
merely  the  representation  of  a  basic  numeral;  thus  >k  represents  2. 
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Church  Slavic 


Higher  numerals  are  shown  as  follows: 


*b  2,000 

®  20,000 

&  200,000 

2,000,000 

Thus,  the  figure  1913  is  written  ?aufr\ 

4.  Hard  sign 

The  hard  sign  ( t  T> )  is  disregarded  in  romanization  when  it  is  found  at  the  end  of  a  word. 

5.  Dates 

While  Church  Slavic  books  are  usually  dated  in  the  Christian  Era,  they  are  in  some  cases  dated 
by  the  year  of  the  Mundane  Era  of  Constantinople.  To  convert  to  the  western  calendar,  subtract 
5508.  However,  prior  to  1700  A.D.,  the  mundane  year  in  Russia  began  on  September  1st,  so 
that  two  years  must  be  given,  obtained  by  subtracting  5509  and  5008,  respectively.  Thus,  7203 
in  the  Mundane  Era  is  1694/95  in  the  Christian  Era.  In  such  cases,  if  the  month  of  publication 
is  known,  the  year  may  be  determined  by  subtracting  5509  for  September  through  December,  and 
5508  for  January  through  August. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

// 

hard  sign 

B7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

0 

acute 

E2 

0 

macron 

E5 

0 

breve 

E6 

□ 

dot  above 

E7 

S 

hachek  (caron) 

E9 

0 

ligature,  1st  half 

EB 

IJ 

ligature,  2nd  half 

EC 

□ 

right  hook 

FI 
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Divehi 


Letters  of  the  Alphabet 


Consonants 

/ 

h 

r 

t  (see  Note  4) 

S' 

s  or  h  (see  Note  1) 

* 

1 

r 

n  (see  Note  2) 

j 

g 

y 

r 

e 

n 

0? 

b 

s 

y 

1 

d 

s 

k 

e 

j 

s\ 

’  or  h  or  omit 

/ 

c 

(see  Note  3) 

y 

V 

6 

z 

y 

m 

E 

t 

J 

f 

J 

p 

y 

d 

S' 

y 

Vowels  (see  Note  5) 


A 


s 

9 

Si 

99 

S 


a 

a 

i 

nr 

l 

u 

u 


c 

SI 


cc 

SI 


9C 

SI 


e 

e 

o 

o 

omit  (see  Note  6) 


Divehi  Equivalents  to  represent  Arabic  Letters 


/. 

y 

(-) 

th 

• 

( ) 

d 

(c) 

h 

• 

(J-) 

t 

•• 

y 

(c) 

kh 

• 

z 

y 

( i) 

dh 

Sl 

• 

(6) 

4  (ayn) 

*15* 

(^) 

sh 

Si 

(e) 

gh 

A-/ 

( ^ ) 

s 

•• 

.5 

(*) 

q 

ALA-LC  Romanization  Tables 


page  59 


Divehi 


Notes 


1.  Romanize  as  h,  when  it  doubles  the  following  consonant,  or  is  used  as  a  glottal  stop. 


ahvana 

masah 

u 


«**  ©  * 
rsJ'si 

©  ✓  ✓ 


2.  When  used  in  medial  position  without  (sukun),  romanize  as  in. 


amga 

hariidu 


Zrs 


3.  Romanization  of  . 


(a)  When  used  in  the  initial  position  with  any  vowel  sign,  do  not  romanize. 


ata 

idu 

umuru 

egahugi 


s 

J*Sl 

s' 

3  3  3 
3  ✓  e 


(b) 


When  used  in  the  medial  position  with  any  vowel  sign,  romanize  as  ’. 


ha’hunu 

fa’isa 

kTn 


✓ 


9  © 

& 


(C) 


When  a  consonant  follows 


in  a  medial  position,  double  it  in  romanization. 


cappalu 

appacci 


Am/ 


(d)  When  used  in  the  final  position  with  (sukun),  romanize  as  h. 

•  ae 

boh  -acp 

biheh  s^s'cP 


4.  Romanize  i*  followed  by  another  J*  as  t. 

atteri 


C  9  ^ 


5.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed.  When  the  vowels  follow 
a  consonant,  ^  is  not  used  and  the  vowel  signs  are  added  to  the  consonant  forms.  Do  not 
distinguish  between  the  two  in  romanization. 
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6. 


(called  sukuri),  generally  indicates  omission  of  an  inherent  vowel  associated  with  a  consonant. 
For  its  other  uses,  see  Notes  1,  3,  and  4. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

( 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

a 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

□ 

underscore 

F6 

0 

upadhmaniya 

F9 
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Georgian 


Values  are  shown  for  the  older  Khutsuri  and  the  modern  Mkhedruli  alphabets.  There  are  no  upper 
case  letters  in  Mkhedruli. 


Upper  case  letters 


Lower  case  letters 


Khutsuri 


Romanization 


Khutsuri 


Mkhedruli 


Romanization 


c 

A 

•3 

6 

a 

\ 

B 

5 

& 

b 

\ 

G 

3 

a 

g 

D 

S 

£5 

d 

*1 

E 

3 

3 

e 

V 

np 

3 

V 

’b 

z 

% 

*b 

z 

O 

T‘ 

at 

CD 

t‘ 

' \ 

I 

•» 

O 

i 

\ 

K 

k 

3 

k 

L 

TO 

CD 

1 

& 

M 

3 

a 

m 

fi 

N 

K 

6 

n 

a 

O 

« 

n 

o 

P 

•o 

3 

p 

M 

Z 

•s 

3 

z 

R 

A 

(d 

r 

li 

S 

b 

b 

s 

Z 

T 

i 

6 

t 

<X 

U 

mi 

a 

u 

<P 

P‘ 

V 

s 

P‘ 

+ 

K‘ 

* 

i 

£ 

n 

6 

Q 

g 

Q 

H 

3 

q 

a 

§ 

a 

3 

V 

s 

r 

* 

& 

c‘ 

6. 

G‘ 

e 

6 

c‘ 

A 

z 

A 

d 

• 

z 

R 

C 

P* 

V 

c 

S 

e 

S 

3 

>✓ 

c 

Z 

X 

a 

b 

X 

TP 

J 

* 

3 

j 

h 

H 

•m 

3 

page  62 


ALA-LC  Romanization  Tables 


Georgian 


Non-Current  or  Rarely  Used  Letters 

Upper  case  letters 

Khutsuri  Romanization 


F 

V 

£ 

4 


E 

Y 

X 

O 

W 


Lower  case  letters 


Khutsuri 


5 

d 

V 

6 


Mkhedruli 

9 

Q 

3 

8 

i 

%  * 

t 

S  B 

it 


Romanization 

c 

y 

X 

o 

w 

f 

e 

‘  (ayn) 
u 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

< 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

e 

hachek 

E9 

n 

L_i 

dot  below 

F2 
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Greek 

Greek 

(Also  for  Coptic) 


Vernacular 

Romanization 

Vernacular 

Romanization 

Upper  case  letters 

Lower  case  letters 

A 

A 

a 

a 

B 

B 

3 

b 

(V  in  Modern  Greek; 

(v  in  Modern  Greek; 

see  Note  1) 

see  Note  1) 

r 

G 

Y 

g  (n  before  medial 

7,  k,  X) 

A 

D 

6 

d 

E 

E 

e 

e 

Z 

Z 

S. 

z 

H 

E 

n 

e 

0 

Th 

o 

th 

I 

I 

i 

i 

K 

K 

K 

k 

A 

L 

X 

1 

M 

M 

P 

m 

Mtt 

B 

/in 

b 

(initial  only) 

(initial  only) 

N 

N 

V 

n 

Nt 

D 

VT 

d 

(initial  only) 

(initial  only) 

X 

S 

X 

O 

o 

o 

o 

n 

p 

n 

p 

p 

R 

P 

r 

*p 

Rh  (see  Note  2) 

*P 

rh  (see  Note  2) 

X 

S 

a  (final  <;) 

s 

T 

T 

T 

t 

Y 

Y 

U 

y  (u  in  diphthongs) 

# 

Ph 

V 

ph 

X 

Ch 

X 

ch 

V 

Ps 

Y 

ps 

0 

O 

cu 

o 

Breathing  Marks 

(see  Note  3) 

Mark 

Name 

Romanization 

> 

soft  breathing 

[omit] 

< 

rough  breathing 

h 
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Notes 

1.  The  era  of  the  Modern  Greek  language  begins  with  texts  written  after  1453.  Texts  written  before 
1454  are  considered  Classical  Greek. 

2.  Diacritical  marks  such  as  accents  and  the  dieresis,  and  the  iota  subscript  (  i  )  are  omitted  in 
romanization.  As  the  result  of  a  presidental  decree  in  Greece  in  1982,  monosyllabic  words  are  now 
written  without  accents;  polysyllabic  words  are  written  with  the  acute  accent  (  6.  )  only. 

3.  The  romanization  for  rough  breathing  is  supplied  whether  or  not  the  mark  appears  in  the  Greek 
vernacular  text. 


Numerals 


Characters 

Value 

Characters 

Value 

a' 

1 

p' 

100 

3' 

2 

o' 

200 

y/ 

3 

x' 

300 

6' 

4 

u' 

400 

e' 

5 

<p ' 

500 

?/  or  ax 

6 

x' 

600 

l' 

7 

V 

700 

n' 

8 

800 

d' 

9 

V 

900 

l' 

10 

ia' 

11 

/a 

1000 

et  cetera 

/aa 

1001 

,a3 

1002 

k' 

20 

et  cetera 

KCi/ 

21 

x3' 

22 

/3 

2000 

et  cetera 

/7 

3000 

4000 

X' 

30 

/£ 

5000 

/ 

40 

et  cetera 

v' 

50 

l' 

60 

o' 

70 

7T7 

80 

y 

90 

CHARACTER  MODIFIERS  IN  ROMANIZATION 


Characters 

□ 

0 


Name 


USMARC  hexadecimal  code 

E5 

F6 
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macron 

underscore 
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Gujarati 


Gujarati 


Vowels  and  Diphthongs  (see  Note  1) 


an 

a 

r 

o 

a 

Si 

e 

y 

i 

an 

a 

e 

T 

an 

ai 

<§ 

u 

aHl 

o 

a 

u 

ant 

o 

aHl 

au 

Consonants  (see  Note  2) 

Gutturals  Palatals  Cerebrals  Dentals 


h 

ka 

at 

ca 

i 

ta 

<t 

ta 

kha 

to 

cha 

t> 

tha 

tha 

ai 

ga 

ey 

ja 

d 

da 

f. 

da 

gha 

3 

jha 

d 

dha 

dha 

d 

ha 

ol 

ha 

na 

< 

na 

Labials 


Semivowels 


Sibilants 


Aspirate 


M  Pa 

pha 

m  ba 

<h  bha 

m.  ma 


ya 

*  ra 

<a  la 

la 
va 


*i  sa 

bi  sha 

y  sa 


ha 


Anusvara  Visarga 

(see  Note  3) 


Avagraha 
(see  Note  4) 


□ 


m 


h 


S’ 


(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  v  )  called  halanta  or 
virama. 

3.  Exception:  anusvara  is  transliterated  by: 

n  before  gutturals, 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

4.  When  doubled,  avagraha  is  transliterated  by  two  apostrophes  (  ”  ). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

9 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

circumflex 

E3 

0 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

0 

circle  below 

F4 
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Hebrew  and  Yiddish 


Hebrew  and  Yiddish 

The  following  romanization  table  attempts  to  represent  the  sound  of  Hebrew  or  Yiddish  words 
but  is  applicable  to  all  Hebraic  languages.  For  Hebrew,  it  approximates  the  modern  Israeli,  primarily 
Sephardic,  pronunciation.  For  Yiddish,  the  table  follows  the  standardized,  principally  Lithuanian, 
pronunciation.  In  romanizing  Yiddish,  the  etymology  of  the  word  is  ignored. 

Consonants 


Vernacular 

Romanization 

Vernacular 

Romanization 

K 

’  (alif)  or  disregarded 

1 

b 

(final  □  ) 

m 

3 

v  (in  Yiddish,  b) 

2  (final  'j  ) 

n 

g 

D 

s 

'T 

d 

V 

‘  (ayn) 

n 

h 

3  (final  ^  ) 

P 

T 

y  (only  if  a  consonant) 

3  (final1!) 

f 

T! 

y  (only  if  a  consonant) 

S  (final  Y  ) 

ts 

T 

z 

P 

k 

n 

h 

T 

r 

i 

sh 

y  (only  if  a  consonant) 

iz; 

a 

s 

3  (final  ^  ) 

k 

FI 

t 

3  (final  ^  ) 

kh 

n 

t  (in  Yiddish,  s) 

Vowels 


In  Hebrew 

Romanization 

In  Yiddish 

Romanization 

□ 

a 

□ 

T 

a  or  o 

N 

a  or  o 

□ 

e 

□ 

e 

u 

0 

• 

i 

• 

□ 

o 

oy 

□ 

••• 

u 

>□ 

V 

e 

i 

>n 

ai 

**□ 

e 

22 

e 

i 

o 

^ 

ay  (if  pronounced  ai 

u 

as  in  aisle,  or) 

n 

i 

e  or  disregarded 

"K ,  •« 

ey  (if  pronounced  ei 

□ 

-i 

a 

as  in  weigh ) 

□ 

VI 

e 

□ 

Tl 

o 
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A  single  prime  (  7  )  is  placed  between  two  letters  representing  two  distinct  consonantal  sounds 
when  the  combination  might  otherwise  be  read  as  a  digraph. 


his 'hid 


TVTDH 


RULES  OF  APPLICATION 

In  romanizing  Hebrew,  it  is  often  necessary  to  consult  dictionaries  and  other  sources  as  an 
appendage  to  the  romanization  table  presented  here,  primarily  for  the  purpose  of  supplying  vowels.  The 
principle  dictionary  used  is  ha-Milon  he-hadash  (Jerusalem:  Kiryat-sefer,  1966-1970)  by  Avraham  Even- 
Shoshan.  More  detailed  instructions  on  romanization  of  Hebrew  and  Yiddish,  including  cataloging 
guidelines,  can  be  found  in  Hebraica  Cataloging  (Washington,  D.C.:  Library  of  Congress,  Cataloging 
Distribution  Service,  1987)  by  Paul  Maher. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

alif 

AE 

C 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

h 

grave 

El 

6 

acute 

E2 

□ 

• 

dot  below 

F2 
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Hindi 


Vowels  and  Diphthongs  (see  Note  1) 


Traditional  Style 

New  Style 

Romanization 

at 

at 

a 

an 

arr 

a 

% 

fat 

i 

i 

aft 

“T* 

1 

z 

at 

>9 

U 

35 

ar 

C\ 

u 

3T 

c 

r 

0 

r 

0 

1 

a? 

* 

e 

ar 

e 

aT 

aT 

A 

e 

ar 

£ 

ai 

* 

£ 

ai 

3ft 

0 

aft 

3ft 

0 

an 

3TT 

A 

O 

aft 

3ft 

au 

aft 

aft 

au 

Consonants  (see  Notes  2  and  3) 


Gutturals 

Palatals 

ka 

ca 

[*’ 

qa] 

cha 

w 

kha 

ar 

ja 

[5T3 

kha] 

[*s 

za] 

n 

ga 

jha 

K 

gha] 

3T 

na 

gha 

[?s 

gfca] 

Z 

na 
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Cerebrals  Dentals 


z 

ta 

ta 

[?3 

ta] 

«r 

tha 

z 

tha 

z 

da 

z 

da 

* 

dha 

z 

ra 

na 

z 

dha 

z 

rha 

ttr 

na 
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Labials 

Semivowels 

Sibilants 

Aspirate 

pa 

ya 

sr  sa 

\  ha 

pha 

T  ra 

tf  sha 

[f  l?a] 

l*3 

fa] 

gr  la 

tr  sa 

5T 

ba 

n  va 

[?3  §a] 

?T 

bha 

ma 

Anusvara 

Anunasika 

Visarga 

Avagraha 

(see  Note  4) 

(see  Note  5) 

(see  Note  6) 

• 

□ 

m 

M/  W  v!/ 

□  n,  m 

:  h 

• 

S’  ’ 

(apostrophe) 

Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  N  )  called  halanta  or 
virama. 

3.  The  dotted  letters,  shown  in  brackets  in  the  table,  are  used  in  Urdu  words. 

4.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
h  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

5.  Anunasika  before  guttural,  palatal,  cerebral,  and  dental  occlusives  is  transliterated  n.  Before 
labials,  sibilants,  semivowels,  aspirates,  vowels,  and  in  final  position  it  is  transliterated  m. 

6.  When  doubled,  avagraha  is  transliterated  by  two  apostrophes  (  ”  ). 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 

Special  character  Name  USMARC  hexadecimal  code 

’  apostrophe  27 

Character  modifiers  Name  USMARC  hexadecimal  code 

E2 
E3 
E4 
E5 
E6 
E7 
EF 
F2 
F3 
F4 
F5 
F6 


0 

A 

a 

□ 

□ 

□ 

□ 

fi 

□ 

Q 

g 

Q 

□ 


acute 

circumflex 

tilde 

macron 

breve 

dot  above 

candrabindu 

dot  below 

double  dot  below 

circle  below 

double  underscore 

underscore 
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Japanese 


Romanization  System 

The  modified  Hepburn  system  of  romanization  as  employed  in  Kenkyusha’s  New  Japanese-English 
Dictionary  (3rd  and  later  editions)  is  used.  For  the  syllabic  nasal,  n  is  always  used  preceding  b,  m, 
and  p.  Romanization  for  words  of  foreign  (i.e.,  non-Japanese)  origin  follows  the  American 
National  Standard  system  for  the  romanization  of  Japanese;  e.g.,  <  -f  (Betonamu); 

x  h  A  (Vetonamu). 


Word  Reading 

The  reading  of  Japanese  words  follows  standard  Japanese  language  usage,  insofar  as  this  can  be 
determined  from  standard  Japanese  dictionaries.  A  current  modern  reading  is  preferred  to  an 
obsolete  one,  except  where  the  usage  of  standard  authorities  has  established  a  particular  reading 
for  a  particular  name  or  book  title.  The  characters  0  &  are  romanized  as  Nihon  unless  the  usage 
of  standard  authorities  has  established  a  particular  reading;  e.g.,  Dai  Nippon  Teikoku,  Nippon’ichi, 
Nippon  eitaigura,  etc.  If  there  are  various  readings,  the  reading  that  appears  most  frequently  in 
dictionaries  is  used. 


Capitalization 


1.  Personal  Names: 

(a)  Capitalize  each  word  of  a  personal  name,  except  the  particle  no. 

Sugawara  no  Takasue  no  Musume  IT  I?  #  M  3k 


(b)  Capitalize  title  and  terms  of  address,  except  when  consisting  of  a  single  character 
or  kana  for  san,  sama,  chan,  kun,  etc.,  that  is  hyphenated  following  a  personal 
name. 

Kobo  Daishi  ^ 

but  Okiku-san  fc  Ira  &  A 


2.  Place  Names:  Capitalize  each  separately  written  word  of  a  geographic  name. 


Yokohama 
Nihon  Retto 
Yuraku-ch5 
Taiheiyfi 
B5so  Hanto 
Toy  5 


m  m 

□  *  m  g 
m  steir 

J3§  8  ¥  « 

*  # 
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Japanese 


3. 


4. 


5. 


6. 


7. 


8. 
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Corporate  Names:  Capitalize  each  separately  written  word  of  a  corporate  name,  except 
particles  and  conjunctions. 

Senso  o  Kirokusuru  Kai  ^  £  iii!  if  A  b  £ 

Nihon  Rikugun  B  ^  P*  M 


Documents  and  Publications: 

(a)  Capitalize  the  first  word  of  the  title  of  a  publication  (book,  periodical,  series,  etc.). 
Tsurezuregusa  li  ? 

Chuo  koron  ^  A  IL  iw 


(b)  Capitalize  the  first  word  of  the  name  of  a  document  (law,  regulation,  etc.). 
Rod5  kumiaiho  ^  i  I  d  S 

Rodo  iinkai  kisoku  7$  ffl  %  It  ^  8'J 


Historical  Events  and  Periods: 

(a)  Capitalize  each  word  of  the  name  of  a  historical  event. 

Dainiji  Sekai  Taisen  —  '/X  t£  f?  A 

Niniroku  Jiken  ~  •  “  A  ¥ 

Meiji  Ishin  shi  Bfj  7a  IS  $Jt 


« 


(b)  Capitalize  the  first  word  of  the  name  of  a  historical  period. 

Jdmon  jidai  $1  X  ft 

Rikucho  jidai  A  W  ^Fft 

Heiancho  ¥  ^  ^ 

Showaki  Bn  fD  ^ 


Peoples  and  Languages: 
Nihonjin 
Amerikajin 
Nihongo 
Eigo 


Capitalize  names  of  peoples  and  languages. 

B  *  A 

7  * 

B  *  g 

£  m 


Religions  and  Sects:  Capitalize  names  of  religions  and  sects. 

Bukkyo  $  ^ 

Kirisutokyo  4  ')  7  b 

Shinto  ^  3 

Zenshu  ^  ^ 


Derivatives  of  Proper  Names:  Lowercase  words  derived  from  names  of  places  or  religions, 
when  the  derived  words  are  no  longer  considered  to  be  proper  names.  When  the  derivative 
is  formed  by  the  suffix  of  a  single  character  following  a  proper  name,  the  proper  name  is 
capitalized  and  the  suffix  is  lowercased  and  follows  a  hyphen.  (See  Word  Division,  4.  Proper 
Names,  Exceptions ) 

nihonto  B  71 
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nihonshu 

0  *  ii 

nihonga 

0  4*  OS 

butsuga 

n  m 

washitsu 

fp  m 

wafuku 

fa  SH 

yoshu 

n  m 

kutaniyaki 

ASI 

kokutani 

kanji 

m  ^ 

kanpo 

m  * 

kan’yaku 

m  m 

zendera 

m  # 

zenso 

n  it 

kirisutosha 

*  0  7s  Y  # 

Taiwan-sei 

Punctuation 

1.  Transcribe  a  centered  point  (  •  )  used  for  dividing  words  as  a  comma  if  it  makes  the  meaning 
of  romanized  words  clear. 

Chugoku  Shikoku  no  mingei  •£  ES  •  0  i  iO  R  * 

Poru  Kur5deru  —  )V  ■  ?  n  —  t  il 

Matsumoto  Seicho,  Yamamoto  Shugoro  shu  fi1,  ^  if  ill  •  lLi  ^  f?3  £  ^ 

For  such  a  center  point  appearing  between  numbers,  see  Word  Division,  5.  Numerals. 

2.  Transcribe  brackets  (  r  ...  J  )  used  in  the  manner  of  quotation  marks  (  "  ...  "  )  as  quotation 
marks. 


Word  Division 


1.  Sino- Japanese  (on)  Compounds:  A  compound  means  a  word  consisting  of  two  or  more 
Chinese  characters  (kanji),  or  of  Chinese  characters  and  kana,  or  of  kana  alone,  whether 
established  by  dictionary  usage  or  not. 


(a) 


Write  binary  compounds  as  single  words, 
ichigen  ikko 
Rikucho  jidai 
Nihon  kokusei  jiten 
kokumin  shugi 
Indo  tetsugaku  shiyo 
T5yo  Gakkai 

Keio  Gijuku  Daigaku  Keizai  Gakubu 
T5kyo  Daigaku  KySyo  Gakubu 


—  s  —  h 

SR±i 
ep  ®  g  &  £  m 
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(b)  Trinary,  derived,  and  other  compounds. 

(1)  Write  trinary  and  derived  compounds  as  single  words  as  long  as  they  contain  no 
more  than  one  binary  or  trinary  compound. 


keizaiteki 

seibutsugaku 

jinseikan 

A49 

yuibutsuron 

m  In 

kenk5h5 

nmm 

daijinbutsu 

X  XVi 

dai5jo 

jibika 

koseibutsugaku 

hoshakaigaku 

For  a  word  beginning  with  such  characters  as  ,  |B  ,  etc.,  consult  any  current 
dictionary  to  determine  whether  it  is  part  of  a  word  or  is  a  prefix  to  the  following  word 
or  words  (see  Word  Division,  3.  Prefixes,  Suffixes,  etc.  (a)).  If  it  is  appropriate,  apply  the 
provisions  of  (2)  below. 

shinkansen  $ 

kyutaisei  IB  ft 

(2)  Write  trinary  pseudo-compounds  formed  by  the  addition  of  a  single  character  as 
single  words. 


goshiso 

m  s  m 

kakusenso 

kakukazoku 

ryoseikatsu 

shinkenchiku 

daijiten 

daihatsumei 

XftBf) 

If,  however,  a  single  character  is  enclosed  within  brackets  used  as  a  quotation  marks, 
transcribe  the  brackets  as  quotation  marks. 

"jin"  shiso  r  {z  JgS 

"kaku" ronso  r  $  j  m  ^ 


(c)  Write  single  characters  in  succession  constituting  a  pseudo-compound  as  one  word, 
todofuken  $  jg  (ft  $ 

shikuchoson  rfj  g  v j  ft 

shichSson  rf?  IT 

shinokosh5  ±Uj^ 

ishokuju  tL  tt  (i 


page  76 


ALA-LC  Romanization  Tables 


Japanese 


(d)  Hyphenate  grouped  compounds  involving  phonetic  changes. 

jochu-b5k5  & 

bungei-dokuhon  X  ^  St  ^ 

goshi-gaisha 

kabushiki-gaisha  S  #  tt 

but  In  proper  names,  romanize  as  Kabushiki  Kaisha  (e.g.,  Nissan  Jid5sha  Kabushiki 
Kaisha). 


(e) 


Hyphenate  one  or  more  single-character  modifiers 
sho-chugakk5 

sh5-chu-koto  gakk5  'J'  ‘ 

sho-chukibo  kigy5  '1' 

jo-gesuid5 

no-san-gyoson 

n5-k5-kogy5 


having  a  common  substantive. 

*  •  *  ¥5 

*  •  g  ^  ft 

• 

±  •  T*ifi 
m  Lb  Wl  Pi 
m • x • 


bun-shi-tetsugaku  X  •  £  •  S  ^ 

Meiji  sanjushichi-hachinen  Vo  H  +  "b  •  A  ^ 


When  single  character  modifiers  form  a  binary  or  trinary  compound,  however,  follow 
1(a)  or  1(b)  above. 

chushfi  kigyS  ‘l3  'J'  £  al 

Bunri  Gakubu  %  3  ^  Sfl 

rikagaku  jiten  §  It  ¥  5?  A 

doshokubutsu  jikken  li  li  ^  is 

2.  Native  Japanese  (kwi  and  iubakovomi  or  vutovomi )  compounds. 


(a)  Nouns. 

(1)  Write  compound  nouns  as  single  words. 


wareware 

ft  T 

wagahai 

ft  n 

kirisame 

ft  m 

teashi 

*3 

yamatodamashii 

mizusakazuki 

7k  s 

ukiyoe 

chanoma 

it  com 

chanoyu 

^  CO  M 

kokoroarigao 

'L;  M  £S 

iyagarase 

rn  ifi  b  •£ 

kogirei 

'b  %  M 

rikutsudori 

i  m  a  o 
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Write  separately  modifiers  which  are  not  part  of  compounds. 


waga  hokori 

ft  ft  &  0 

waga  musuko 

b  tf  A  =F 

waga  machi 

ft  ifi  ®T 

waga  kyodo 

ft  ifi  «p  ± 

;e  of  doubt,  prefer  the  separate  form. 

waga  kuni 

ft  m  (b  #  m  ) 

waga  ko 

ft  ifi  T 

waga  tomo 

ft*  <*>>&<*  ) 

chichi  haha 

as 

ani  imoto 

ft  i* 

are  kore 

*  tl  Z  tl 

(2)  Write  separately  a  lain  single  character  word  modifying  a  compound, 
onna  ekaki  £  g  fr  % 

aji  jiman  Cfc  tl  tft 

koto  gasso  ¥  a  ^ 

mizu  shigen  &  V® 

kome  sod5  ^  II  fi 

otoko  aite  ^  ^ 


(b)  Verbs. 


(1)  Write  simple  and  compound  inflected  verbs,  with  their  auxiliaries,  as  single  words. 


shihaisuru 

doraibusuru 

yomiuru 

nashienai 

kansuru 

omoidasu 


H  y 


£  ie  r  h 
y  a  rri 
fn.  A-  h 

l  V' 

HtS 
S  V'i  d if 


(2)  Write  verbs  separately  from  adverbs  or  inflected  adjectives  and  verbs. 


do  kangaeru 
aa  shitai 


Z  o  %  Jt  *. 
h  h  L  tz  u 


so  suru 
ko  naru 
tsuyoku  naru 
utsukushiku  naritai 
ikite  ita 
kaette  kuru 
yatte  miy5 
itadaite  ikimasu 


w  o  tc  b 

@  <  &  h 

H  L  <  fc  0  tz  u 

tz 

m  o  t  *  & 

*  o  t  H  «k  o 

a  t 
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(3)  Write  honorific  auxiliaries  or  potential  auxiliaries,  dekiru  and  dekinai ,  separately 
from  other  parts  of  the  verb. 

ookuri  itashimasho 

odekake  asobashimasu  ka 

h  th  8Ht  m  t£  L  t  r  fr 

gaman  dekiru  ka 

»  t ZlhXZfr 

gaman  dekimasen 

t  a. 

Adjectives.  Write  compound  inflected  adjectives  as  single  words. 

bimyonaru 

m  &  %  z 

ikanaru 

£□  ffl  &  Z 

miryokuaru 

\£t>b  Z 

teikonaki 

dodotaru 

^  <7  tz  Z 

osorubeki 

®.  Z 

ayamatta  saho,  ayamariyasui  sah5  ^  "> 

tz  w  &  •  m  o  £  ^  w  a 

Adverbs  and  coniunctions.  Write  compound  adverbs  and  conjunctions  as  single  words. 

tokuni 

&  1C 

narabini 

tz 

tomoni 

ft  tz 

tsuini 

m  tz 

ikani 

3P  t°I  tz 

suguni 

m  <  tz 

matawa 

X  li 

aruiwa 

5Jt  tUi 

(e)  Particles.  Write  particles  separately  from  other  words  and  from  each  other, 
kofuku  e  no  michi  ^  ^  'v  cr>  it 

E  wa  dare  ni  de  mo  kakeru  ^  li  It  (Z  T  i>  fS  (f  Z 

So  iu  hon  o  yomu  no  ga  tanoshii  o 

anata  to  watashi  to  b  &  tz  t  b  tz  L  t 

kumo  no  ue  ni  ®  CD  ±  (c 

anata  no  tame  ni  b  &  t:  <7)  M  lz 

nonki  na  ojisan  nonki  ni  kamaeru  xi  &  d'  $£  §  A  §  xi  In  Ml  £  Z 
yuniku  na  sonzai  a  r.  —  p  %  ft  fa 

Prefixes,  Suffixes,  etc. 

(a)  Write  separately  a  single-character  prefix  modifying  on  or  kun  compounds  following  it. 
zen  shusho  enzetsushu  Iu  M  fg  M  &  H 

ko  shacho  kaikoroku  fit  -K 

shin  okurigana  St  '&  0  Kc 

shin  shokuminchi  shugi  St  If  K  tit  ±  H 

kyu  dotokuritsu  IB  iS  ??  '0 

kyu  dojin  shakai  IB  ±  X  tt  # 
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Dai  jinmei  jiten 

*  A  £  *  * 

sho  bungaku  jiten 

ch5  senshinkoku 

gftit  E 

ch5  genjitsu  shugi 

han  senso  undo 

han  sensoron 

kaku  jidai 

§  83  It 

kaku  todofuken 

§ § 

kaku  musan  seit5 

hi  bunkateki 

n  x  it « 

hi  sabetsu  shakai 

it  %  m  a  # 

ichi  toshokan’in 

ichi  kinen  shashin 

-  IE  3:  M  * 

(b)  Hyphenate  a  single  character  modifying,  or  modified  by,  foreign  words  generally  written 
in  katakana. 


sho-enerugf 

tIj  X  4'  tV 

kaku-enerugf 

datsu-enerugf 

U  x  *  tv  %  - 

sho-ene 

X  * 

enerugf-gen 

x  *  dr'  -  m 

karorF-hy5 

Jj  o  >J  - 

irasutoreshon-teki  ^ 

5^  hk-j/  a 

If  the  foreign  word  in  katakana  together  with  a  single  character  is  a  long-established 
word  or  a  corporate  name,  however,  romanize  it  as  one  word. 

Amerikajin 

7^  U  ^  A 

kirisutosha 

^  'J  A  M 

Saiensusha 

t  ^  x  y  X 

sabisugyo 

■9-  -  t*  X  ^ 

Write  the  suffix  to  or  nado  (  S?  ,  &  b),  hen  (  ,  S3 )  used  for  sections  of  books,  and 

sho  O  ,  tJ')  for  excepts  or  commentaries,  kb  ,  fg)  for  treaties  or  drafts,  and  ten 

(  St)  for  exhibitions,  separately  from  the  word  preceding  them  unless  they  form  Sino- 

Japanese  compounds,  e.g.,  H  .  f 1  ^  > 

?  55  ,  «  »J  1 

kyodo  kiken  koi  to  no  kinshi  ihan 

kyo  konogoro  omoidasu  koto  nado 

t  Ke  b 

senzen  sengo  hen 

Nihon  shokunikushi  sho 

b  *  it  ^  *  # 

Nihon  insho  kb 

B 

Shina  shoshigaku  kb 

£  S5  #  £  ¥  K 

TSzai  bijutsu  koryu  300-nen  ten 

HE  1  H  X  ft  300^  m 
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(d) 


Write  single-character  substantives  modified  by  on  or  kun  compounds  as  part  of  the 
word  preceding  it. 


Ochiboshu 

f&  If 

Kokinshu 

£  ^ 

Kokin  wakashu 

Bunka  jinmeiroku 

X\t 

Nihon  seifu  gyosei  kikozu 

B  *  ft  Iff  It  ft  &  81 

Nihon-Ro  gaikb  junenshi 

Gakko  toshokanh5 

¥  n  m  m 

Kokubunji  shryo  chosa  hokokusho 

Meiji  Taisho  bungakushi 

Kagoshima-ken  fubutsushi 

Ig  ft  A  #  A  to 

shizan  kagakusha 

nm% 4  ^ 

jibi  inkoka 

H  OS  ng 

kyodai  shimaitachi 

R5do  Kijunkyoku 

Asahi  Shinbunsha 

w  b  m  m 

Nihon  Bungaku  Kenkyukai 

B 

m 

m 

% 

if 

m 

a 

m 

£ 

# 

n 

m 

m 


If  the  word  romanized  together  with  a  single-character  substantive  becomes 
meaningless,  hyphenate  it  with  the  word  preceding  it. 

Nihon  gunkoku-shugika  B  4*  W  S3  i  #§  T 

hatten-tojokoku  ^  _t  B 


4.  Proper  Names. 

(a)  Write  proper  names  and  titles  of  books  separately  from  modifiers  or  words  modified 
by  them. 

Rinkan  den  ^  's  i3  —  's  & 

Niwa  Fumio  shu  ^  33  ~X  it  % 

Genji  monogatari  sho  ft  IS  & 

Shin  jidai  ^  ^ 

Min  Shin  jidai  ft  it 

To  shi  sho  ^ 

Nihon  shi  B  t  i 

Beikoku  shi 

Toyo  shiron  S  #  £  iw 


Exceptions: 

(1)  For  proper  names,  including  corporate  names,  that 
contain  other  proper  names,  follow  l(a)-l(b),  2(a),  and 
3(d)  above. 

Edojo  P  ® 

Shijogawara  0  iff  i5!  RF 

Sohokai  If  # 
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Onogumi  E  Ff  IS 

Gendai  Nihonshi  Kenkyukai 

(2)  Write  names  of  historical  periods  with  single¬ 
character  generic  terms  as  single  words. 

Shincho  /f 

Meijiki  f&  M 

(3)  For  names  of  ships,  write  such  prefixes  as  maru  or  go 
separately  from  the  preceding  word. 

Asama  Maru  &  ffl  A 

Hayabusa  G5 

Purejidento  Wiruson  Go  7°  V  i/  t  >  b  •  *7  <  E  V  >  # 

(4)  Hyphenate  single  characters  which  can  be  suffixed  to 

any  proper  names;  e.g.,  &*)  ,  ^  »  S  »  >  ifc  »  &  *  ^ 


0 

#  it  Si! 
B  E  E  It 


Nihon-teki 
Honkon-gata 
Tanaka  Chiyo-shiki 
Hosho-ryu 
Hokkaido-san 
Taiwan-sei 
Tanaka-ha 
Nakasone-kei 
Kanda-bon 
Kanazawa  Bunko-ban 

(5)  Hyphenate  an  auxiliary  to  a  proper  name,  which 
results  in  a  double  consonant. 

KyashT-tte  yonde  —  r>  X  J;  ^  ~C 


3L  E  flu 

dk  M  ifi  g 

£  m 

E  E  % 
E  ©  ffl  * 

£  iR  ~X  H  B5 


(b)  Write  titles  and  terms  of  address  separately  from  personal  names. 

(1)  Write  separately  a  title  that  precedes  a  personal  name. 

Sei  Sabieru  1=  E  t'  x  )l 

(2)  If  a  title  or  a  term  of  address  following  a  personal  name  consists  of  a  binary  or 
trinary  compound,  write  it  separately  from  the  personal  name. 

Meiji  Tenno  V&  3^  M 

Taiken  Mon’in 

Kitashirakawa  no  Miyasama  dk  6  )H  <T>  H  It 

Nichiren  Shoninsama  B  31  AI 

Takezawa  Sensei  Pr  9t  E 


(3)  Hyphenate  a  title  or  a  term  of  address  when  it  consists  of  a  single  character  or 
kana  for  san,  sama,  chan,  kun ,  etc.,  and  follows  a  personal  name. 

Basho-o  **  “  **' 

Kakushin-ni  ^  fa  IS 

Gotoba-in  &  ft  33  ^ 
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Okiku-san 

Nakamura-kun 

Taro-chan 

Non-chan  kumo  ni  noru 
Sa-shi 


&  ^3  $  A/ 

*  an  h  *  a 
y  >  u  i  c  t  ^ 
4  ft; 


(c) 


Geographic  names 

(1)  Hyphenate  generic  terms  used  as  part  of  the  name  of  jurisdictions  or  streets. 


T5kyo-to 

515  £  IP 

Chiyoda-ku 

IMtfflE 

Yuraku-ch5 

m  £  ®t 

Yamaguchi-ken 

iii  □  § 

Yokohama-shi 

Ogawa-machi 

*  JM  U 

Ogasawara-mura 

Exceptions: 

(1)  Write  generic  terms  separately  if  they  form  Sino- 
Japanese  compounds. 

Ogasawara  soncho  '1  ££  -ft 

Ogawa  choshi  T  i'l  ®T  £. 

Chiyoda  Kuritsu  T  ft  EB  E  A 

Yokohama  shisei  rfr 

Tokyo  tomin  ^  Jit  IP  R 

Yamaguchi  kenp5  lLj  □  §  ^ 

(2)  Write  koku  as  part  of  the  name  of  a  country. 

Nihonkoku  B  ^  S 

Manshukoku  ^  ^  ® 

Kankoku  ®  ® 

(3)  Write  Kuni  separately  if  preceded  by  no  in  the 
name  of  a  province. 

Musashi  no  Kuni 


r  m  m  (ft  *  <?>  m  ) 


(2)  Write  modifiers  differentiating  places  of  the  same  name  as  part  of  the  name,  if  they 
are  part  of  the  name  of  a  jurisdiction. 

Higashiizu-cho 

Kamikitayama-mura  _t  db  lLi 


*  f?  s  IT 


If  the  modifiers  are  not  part  of  the  name  of  a  jurisdiction,  write  them  separately. 
Minami  Yamashiro  ^  ill  ^ 

Tonan  Ajia  M  T  ^  T 
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(3)  Write  compound  names  designating  merged  places  or  containing  a  larger  place 
name  as  single  words. 

Ujiyamada-shi  ^  fa  ill  EB  rfi 

Aizuwakamatsu-shi  *  Tfr 

(4)  Hyphenate  generic  terms  for  stations  and  harbors  following  place  names. 

Tokyo-eki  *  £  Sfi 

Yokohama-ko  *e  fit 

If  the  generic  terms  form  Sino-Japanese  compounds,  write  them  separately. 
Tokyo  ekicho  £  m  |R  ft 

Yokohama  kowan  m  M 

(5)  Write  generic  terms  for  geographic  features  as  part  of  the  name. 

Sumidagawa  PS  S  J*l 

Asamayama 

Biwako  K  M  ffl 

Shinanoji  fl  M  Sa 

Saipanto  ^  ^  >  S 

If  the  generic  terms  for  Sino-Japanese  compounds,  write  them  separately. 

Biwa  kohan  IS  11  l¥ 

Asama  sanroku  i#  fW  ill  ^ 


(d)  Abbreviated  Forms 

(1)  Write  words  consisting  of  or  containing  abbreviated  proper  names  as  single  words. 
Nobi  Heiya  ?I  H  ¥  ¥f 

Keihin  kogyo  chitai  W  X  tft  ^ 

Meishin  kosoku  doro  £  $  15  ii  iM 

Shin’etsusen  fl  ^ 

Obeijin  BX  ^  A 

ryuo  gakusei  li  BX  ^  X 

Eishibun  %  &  A 

tainichi  b5eki  W  0  Kl 

Kiki  kayo  sd  <£  IS 

Hyphenate,  however,  a  compound  consisting  of  abbreviated  names  of  countries  or 
languages,  except  when  the  compound  is  normally  elided. 

Nichj-Ro  gaiko  junenshi  B  §1  32  +  ^  £ 

Nichi-Ei-Doku  igo  shojiten  B  $$  J£  f£  H  A  & 

Sen-Man  sosho  $8  H  H 

Nikka  daijiten  B  IXil  ^ 

Nitchu  kankei  B  cp  H 
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(2)  Write  contracted  compound  proper  names  as 
Chuky5  juy5  bunkenshu 
Mantetsu  chiho  gyoseishi 
Soren  no  Nihon  kenkyu 
Hokushi  Jiken  no  keika 
Sa5  monogatari 
FukuS  hyakuwa 

5.  Numerals 

(a)  Write  cardinal  numbers  under  100,  and  the  numbers  100,  1,000,  10,000,  and  100,000, 
etc.,  as  single  words,  if  spelled  out.  Separate  by  hyphens  the  hundreds,  thousands,  tens 
of  thousands,  etc.,  in  numbers  over  100. 

sen-kyuhyaku-hachijusannen  f  A  I  t  r  $ 

(b)  Write  the  ordinal  prefix  dai,  numerators,  and  other  suffixes  as  part  of  the  numbers  they 
precede  or  follow,  if  spelled  out.  Write  them  separate  from  the  words  they  modify. 

daisan  seiryoku  M  ti 

Dainiji  Sekai  Taisen  kaikoroku 


single  words. 

mmx mm 

S  ft  t|  (t  K  i 
V  &  CO  0  2k  5Jf  £ 

a  x  ¥  ft-  eo  m.  i§ 

&  $  v>  S3 
a  m  ws 


(c) 


Transcribe  a  center  point  between  numbers  as  a  period,  with  no  space  following  it.  If, 
however,  the  number  is  spelled  out  in  romanization  as  a  word,  then  the  center  point 
is  not  represented  by  a  Roman  value  (cf.  Library  of  Congress  Rule  Interpretations, 
Appendix  C.4C). 


8.15  zengo  :  senso  to  watakushitachi 
3.14  fut5  hanketsu 
20.5-seiki  no  ongaku 
but  Niniroku  Jiken 


8-  15H  £  :  m  &  t  fl  fc  % 
=.  ■  —  23  T'  '-M  £'J  iJt 
20-5tM£?)#& 


(d)  Hyphenate  numbers  joined  to  modify  a  common  substantive. 
3-4-jigen 

Showa  58-59-nendo  5  A 


3-4&7L 

E  A  ^  IS 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

y 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

a 

macron 

E5 
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Javanese,  Sundanese,  and  Madurese 


1.  Principal  consonants1 

cum 

urn 

(h)a2 

(Ul 

^ji 

pa 

CKl 

d 

na 

QJ1 

CJ 

dha 

001 

o 

ca 

OK 

ja 

in 

in 

ra 

cum 

cum 

ya 

(KTl 

ka 

conn 

A 

nya3 

cm 

da 

CEJl 

0 

ma 

asm 

QSV 

ta 

nnn 

nnn 

ga 

0J1 

sa 

con 

on 

ba 

o 

0 

wa 

op 

GJ 

tha 

arm 

cn. 

la 

ca 

ca 

nga 

2.  Other  consonant  forms4 

OK 

nya 

coin 

na 

(na) 

crun 

ga 

(gha) 

ca 

(cha) 

ba 

(bha) 

cinnn3 

ra 

cum 

Jin 

ha 

CKV 

ka 

(kha) 

(KTl 

kha 

(Vi 

4 

ta 

(tha') 

• 

•  • 

cui 

«• 

^ji 

fa 

cm 

sa 

(£a) 

• 

•• 

OK 

za 

cuu 

sa 

(ja) 

am 

»*• 

cinn 

gha 

cu 

pa 

(pha) 

ca 

• 

•• 

on 

*a 
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3.  Vowels  and  other  agglutinating  signs5 


Vowels 

Other 

26 

&A J 

a 

\ 

h 

"1 

f 

"1 

e 

r 

o 

o 

2 

g 

ra 

Q 

Q 

can 

s 

i 

OJl 

s 

r£ 

nrj  2 

(Tj  2 

9 

9 

lg 

b 

t> 

9 

u 

A 

ys? 

ng 

4.  Numerals 

arm 

i 

So 

CM 

3 

(T  4  <3| 

5 

6  arm 

7  (L3i 

8 

cuui  9  o 

0 

Notes 

1.  Each  consonant  has  two  forms,  the  regular  and  the  appended,  shown  on  the  left  and  right 
respectively  in  the  romanization  table.  The  vowel  a  is  implicit  after  all  consonants  and 
consonant  clusters  and  should  be  supplied  in  transliteration,  unless: 

(a)  another  vowel  is  indicated  by  the  appropriate  sign;  or 

(b)  the  absence  of  any  vowel  is  indicated  by  the  use  of  a  paten  or  pangkon  sign  (  J| ). 

2.  This  character  often  serves  as  a  neutral  seat  for  a  vowel,  in  which  case  the  h  is  not 
transcribed.  Generally  speaking,  ha  in  word-initial  or  vowel-medial  position  in  a  root  word, 
is  romanized  without  the  h.  Root  word-final  ha  followed  by  suffixal  vowels,  on  the  other 
hand,  is  always  romanized  with  h.  When  questions  arise  as  to  whether  the  h  should  be 
represented,  consult  a  standard  dictionary  of  Javanese  in  Latin  script,  especially  Pigeaud, 
Poerwodarminto,  or  Horne. 

3.  The  letter  nya  before  the  letters  ja  and  ca  should  be  romanized  as  n  not  ny,  as  in  the  words 
panca,  prenjak,  and  so  forth. 
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4.  "Other  consonant  forms"  refers  to  aksara  murda  ("on  the  left")  and  aksara  rekan  ("on  the 
right").  The  former  are  similar  to  capital  letters  and  have  an  honorific  effect  which  is  not 
preserved  in  standard  romanization.  The  aksara  rekan  are  used  to  indicate  phonemes  alien 
to  Javanese,  particularly  in  words  or  Arabic  and  Dutch  origin.  Nonce  forms  or  idiosyncratic 
usages  may  also  be  encountered.  These  are  best  romanized  in  accordance  with  the  spelling 
of  the  foreign  word  intended.  In  words  of  Old  Javanese  origin,  the  same  characters  represent 
aspirated  or  other  consonants  and  should  be  romanized  with  the  alternative  equivalents 
provided  in  parentheses. 

5.  Vowels  are  almost  always  indicated  by  one  of  a  class  of  agglutinating  signs  ( sandangan ) 
added  above,  below,  before,  or  after  the  consonant  or  consonant  cluster  which  they  affect. 
Other  signs  are  used  to  indicate  the  semi-vowels  r,  /,  and  y,  as  well  as  the  consonants  h  and 
ng,  when  they  occur  in  certain  positions  within  a  syllable.  Free  standing  vowels  (shown  to 
the  right  in  romanization  table)  are  rare,  usually  occurring  in  the  initial  position  in  words  of 
foreign  origin.  No  difference  between  vowels  indicated  by  free  standing  characters  and  those 
represented  by  agglutinating  signs  is  preserved  in  romanization. 

6.  The  various  dirga  marks,  which  in  Old  Javanese  represented  long  vowels,  are  used  as 
colometric  devices  in  New  Javanese.  They  are  not  represented  by  macrons  in  romanization. 

7.  Sometimes  this  character,  the  a-kara,  is  used  as  a  neutral  seat  that,  when  marked  with  the 
appropriate  sign,  can  also  be  transliterated  as  i,  u,  etc.  Thus  is  romanized  i,  and  so 
forth. 

8.  Penultimate  o  in  an  open  syllable,  when  the  final  syllable  is  also  open,  is  romanized  as  a,  as 
in  the  words  panca,  angka,  rangga  and  so  forth. 

9.  This  character,  the  ya-pengkal,  is  used  in  consonant  clusters  within  words.  Consonant 
clusters  between  words  formed  when  the  second  word  begins  with  y  use  the  ordinary 
appended  form  of  ya.  In  romanization  these  two  forms  are  not  distinguished. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

* 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

o 

tilde 

E4 

□ 

breve 

E6 

□ 

dot  below 

F2 
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Kannada 


Vowels  and  Diphthongs  (see  Note  1) 


e 

a 

°2 

1 

o 

w 

a 

•0 

e 

« 

i 

to 

e 

3s 

T 

1 

SO 

ai 

eno 

U 

2o 

o 

erus 

u 

to 

5 

si) 

r 

o 

SB 

au 

sjjs 

r 

o 

Consonants  (see  Note  2) 

Gutturals  Palatals  Cerebrals  Dentals 


ka 

23 

ca 

0 

ta 

3 

ta 

so 

kha 

d 

cha 

d 

tha 

tha 

ri 

ga 

as 

ja 

d 

da 

d 

da 

3* 

gha 

dp 

jha 

d 

dha 

d 

dha 

a 

ha 

•sr 

ha 

na 

na 

Labials 

Semivowels 

Sibilants 

Aspirate 

3j 

Pa 

cdo 

ya 

t>  sa 

a5  ha 

pha 

d 

ra 

35  sa 

20 

ba 

86 

ra 

A  sa 

d 

bha 

o 

la 

SdO 

ma 

s? 

la 

CO 

la 

ij 

va 

Anusvara  (Bindu 
or  Sonne) 

(see  Note  3) 

o  m 


Visarga 


h 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  superscript  sign  (  ) 

3.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
h  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 


CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  modifiers 

Name 

USMARC  hexadecimal  code 

* 

□ 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

5 

circle  below 

F4 

□ 

underscore 

F6 
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(in  Perso-Arabic  Script) 

Letters  of  the  Alphabet 
Consonants 


Initial 

Medial 

Final 

Alone 

Romanization 

-J 

-J* 

b 

mJ 

*- 

U 

P 

J 

U. 

CL*. 

CL 

t 

u 

la 

la 

la 

t 

e-J 

J 

-L 

L. 

CL 

s. 

2? 

C- 

c 

j 

s 

c 

c 

h 

.5*. 

c 

kh 

X 

d 

la 

la 

Jo 

i- 

d 

i 

i. 

i 

z 

j 

V 

j 

r 

la 

Ja 

Jo 

4o 

-J 

J 

r 

J 

3~ 

J 

z 

J 

-> 

3- 

3 

ts 

-ajU 

-ul. 

o— 

s 

Jli 

— Lu. 

u- 

£ 

s 

-*£)» 

u3- 

u63 

s 

kJ^- 

u*3 

z 

JL 

Lx. 

Lx. 

L> 

1 

L, 

Lx. 

Lx. 

Lx 

z 

X 

A 

t 

‘  (ayn) 

X 

A 

e- 

i 

gL 

J 

jL 

f 

-a 

.L 

<3 

q 

.L 

^  j  d 

k 

g 

J 

1 

J- 

j 

i 

r^- 

p 

m 

J 

_*. 

o~ 

o 

n 

J 

> 

3~ 

3 

V 

■& 

o  <  <*. 

& 

h 

-J 

7*- 

C- 

C- 

y 
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Vowels 


Initial 

Medial 

Final 

Alone 

Romanization 

*•* 

1 

□ 

□ 

1 

a 

T 

L 

p 

L 

p 

T 

a 

1 

□ 

□ 

1 

a 

1 

1 

1 

1 

a 

• 

1 

□ 

-X* 

□ 

1 

• 

1 

v' 

7D 

crO 

c5l 

r- 

1 

1 

p 

□ 

P 

□ 

P 

i 

u’ 

J 

□ 

□ 

J 

u’ 

r 

i 

□ 

9 

□ 

r 

1 

u 

Si 

So 

So 

S' 

u 

ii 

JO 

io 

3* 

0 

^0 

-?l 

5 

^0 

^0 

^1 

9 

- 

- 

Tjl 

5 

• 

2? 

e 

j 

-*□ 

i 

e 

- 

5-0 

v5° 

i 

y  (see  Note  1) 

Digraphs  Representing  Kashmiri  Aspirates 


ph 

At 

th 

A^ 

th 

4s 

ch 

4? 

kha 

4^ 

tsh 

Ay 

Notes 


1. 


This  sign  palatalizes  the  preceding  consonant  and  in  the  medial  position  is  at  times  followed  by 
another  vowel,  e.g.  a  (  □  )  or  e  (  ^  ). 

apyar  J**l 


When  the  short  vowel  e  (  )  follows  a  palatalized  consonant,  replace  it  by  0 . 

khyen  Cm4^ 
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RULES  OF  APPLICATION 

Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 

1.  Romanize  the  Arabic  article  Jl  as  follows: 

(a)  When  the  article  is  prefixed  to  a  word  beginning  with  a  "moon  letter"  (l‘V‘C‘C‘C‘6 

romanize  it  as  al. 

al-Quran 

(b)  When  the  article  is  prefixed  to  a  word  beginning  with  a  "sun  letter"  (  Cj  «  Cj  *  ^  ‘  ^  «  J  4 

•I3  <  <  J  <  u  ),  replace  the  /  of  the  normal  Roman  value  with  a  repetition 

of  the  letter  of  digraph  that  begins  the  romanization  of  the  word  to  which  the  article  is  prefixed. 

an-nur  JA  i 

(c)  When  the  article  precedes  the  second  noun  of  a  gentive  phrase  ( idafah ),  follow  the  preceding 
directions  for  sun  and  moon  letters,  but  substitute  the  final  short  vowel  of  the  Roman  value  of 
the  first  noun  for  the  a  of  the  normal  Roman  value  of  the  article.  Romanize  such  two-word 
genitive  phrases  as  single  words. 

Muhmddm 

2.  Represent  £3  ( shaddah  or  tashdui)  by  doubling  the  Roman  letter  or  digraph  concerned. 

Muzaffar 


□  ( jazm )  or  &  ( sukun )  indicate  the  absence  of  a  vowel  following  the  letter  over  which  it  is  written. 
It  is  not  represented  in  romanization. 


rong 


4.  The  vowel  sign  □  is  often  omitted  in  writing,  but  supply  its  Roman  value  a  in  romanization. 


Romanization  of  Foreign  Words 

Foreign  words  in  a  Kashmiri  context  are  romanized  according  to  the  present  table  and  accompanying 
rules.  This  principle  covers  all  languages,  including  those  also  written  in  the  Perso-Arabic  script 
(e.g.,  Persian  or  Urdu).  The  following  are  specific  guidelines,  necessarily  derived  from  or  related 
in  some  way  to  this  principle. 

1.  Romanize  jd  and  I5O  (or  )  as  au  and  ai ,  respectively,  when  they  are  pronounced  as  diphthongs. 

2.  Romanize  ,3  (alif  maqsurah )  as  a  when  it  is  used  in  place  of  I'  to  represent  the  long  vowel. 

Tsa  Jr^rt-Z 

Musa 
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3.  Romanize  tanwm  (written  □ ,  □  (l§  ),  or  9  )  as  un,  an,  and  in,  respectively,  when  it  occurs  in  a  word 
or  expression  borrowed  from  Arabic.  Otherwise,  do  not  romanize  it. 

andazan 

‘amalan 


4.  Regardless  of  pronounciation,  romanize  u  (undotted  form  of  the  letter  O  )  as  n. 

Jammun 

jahan 


5.  Romanize  *  ( hamzah )  as  ’  ( alif)  when  medial  or  final. 

assahada  ’ 

aTnah 

mu’min 


£  I  -.ill 


6.  When  *  ( hamzah )  represents  the  connective  syllable  joining  a  mufaz  to  what 

Idarah-yi  Haft  ... 


follows,  it  is  romanized  -yi. 


7.  Izafat. 


(a)  When  the  muzaf  (the  first  of  two  words  in  a  grammatical  relationship  called  izafat )  ends  in  a 
consonant,  add  -i  to  it  in  romanization. 

Tarlkh-i  Hindustan 


(b)  When  the  muzaf  ends  in  a  vowel  or  a  silent  a  ,  add  -yi. 

darya-yi  sor 

Malikah-yi  Inglistan  <^lo 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

alif 

AE 

< 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

0 

w 

double  dot  below 

F3 

□ 

underscore 

F6 
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Khmer 

Consonants 


Full  Form 

Subscript 

Romanization 

Full  Form 

Subscript 

Romanization 

n 

□ 

n 

k 

5 

□ 

d 

2 

□ 

6) 

kh 

/vs 

U 

□ 

dh 

n 

□ 

n 

8 

a 

□ 

«*■ 

n 

m 

B 

gh 

u 

DJ 

P 

6 

□ 

c/ 

• 

n 

tS 

s 

ph 

u 

2 

c 

n 

□ 

tl 

b 

s* 

□ 

ch 

T\ 

□ 

r\ 

bh 

□ 

j 

h 

□ 

-b 

m 

fUl 

SI 

jh 

m 

DJ 

y 

0 

□  or  □ 

m 

n 

i 

(? 

r 

□ 

ft 

t 

fU 

□ 

oj 

1 

u1 

□ 

ct 

th 

f 

□ 

V 

* 

□ 

7 

d 

» 

□ 

a 

✓ 

s 

m 

R] 

dh 

tf 

DJ 

s 

fin 

□ 

run 

n 

M 

s 

n 

□ 

0 

t 

in 

0 

LTI 

h 

ti 

□ 

c 

th 

- 

1 

H 

D 

q 

Vowels 

Independent  Romanization 

Independent 

Romanization 

i 

(3  or  § 

o 

t- 

l 

5  or  d 

au 

u 

y 

r 

o 

^  or 

3i  u 

a 

r 

o 

b 

e 

D 

1 

o 

0 

ai 

a 

r 

o 
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Vowels 


Dependent 

Romanization 

Dependent 

Romanization 

□ 

□a 

e 

□ua 

□  □ 

□aD 

tH 

□oe 

□  □ 

□  an 

□  ya 

O' 

□  □ 

□  an 

toj 

□ia 

tn 

□a 

tn 

□e 

tf)D 

□  aQ 

tn 

□ae 

a 

□i 

iu 

□ai 

a 

□r 

tal 

□o 

a 

□y 

tnV 

□au 

a 

□y 

o 

□ 

□am 

□ 

* 

□u 

u% 

□ah 

□ 

u 

□  u 

□: 

□  a 

Diacritical  Marks 


Vernacular 

a 


□ 

□ 

□ 

+ 

□ 


□ 

A 

Notes 


Alternative 

9 

9 


Romanization 
u  (hard  sign) 

/  (soft  sign  (prime)) 
rn 

□  °  (circle  above) 

□  ’  (alif) 

□  ‘  (ayn) 

□ '  (dot  above) 


1.  In  the  consonant  portion  of  this  romanization  table,  the  special  character  □  shows  the  position 
of  a  Khmer  script  character  below  which  a  subscript  character  is  written.  A  subscript  character  is 
always  romanized  after  a  full  form  character,  without  an  intervening  vowel,  as  in  (n|n  (krakhvdk). 


2.  When  0  (h)  occurs  with  a  subscript  character,  the  lower  element  is  omitted,  as  in  tin  (nj).  When 
0  occurs  as  its  own  subscript,  it  takes  the  full  form  0,  as  in  fiQT)  ( kahha ).  Otherwise,  the  subscript 
has  the  form  of  the  lower  element  alone,  as  in  2  ( khh ). 


3.  The  consonant  U  (p),  followed  by  the  vowel  1  (a),  takes  the  special  form  Cp. 
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4.  In  the  vowel  columns,  □  shows  the  position  of  the  consonant  relative  to  the  vowel.  This  applies 
to  both  the  Khmer  vernacular  and  to  the  romanization  columns.  It  should  be  noted  that  □  in  the 
Khmer  vernacular  column  can  also  represent  a  final  consonant  with  no  vowel  following,  in  which 
case  it  is  romanized  simply  as  □  ,  as  in  Sn  (dab). 

5.  The  consonants  °  (m),  and  %  (h)  are  always  preceded  by  a  vowel,  but,  being  finals,  never 
themselves  bear  a  vowel.  Vowels  other  than  a  may  precede  them,  as  in  (turn),  U\1S  (seh). 

6.  The  diacritics  ”  and  ~  are  romanized  by  "  and  1  respectively,  immediately  following  the 

consonant  they  modify.  They  have  the  alternative  form  when  they  co-occur  with  one  of  the 

superscript  vowels  **,  and  **  .  When  0  co-occurs  with  one  of  the  superscript  vowels  and 
with  one  of  the  consonants  n,  0,  U,  tU,  J,  f ,  or  U,  it  is  romanized  as  "  ,  as  in  U  (p"i ).  When  □ 
co-occurs  with  one  of  the  superscript  vowels  and  with  one  of  the  consonants  W,  tfl,  or  H,  it  is 
romanized  1  ,  as  in  $  (s'l).  Otherwise,  □  represents  the  vowel  u,  as  in  US  (muii). 

7.  The  diacritics  □ 0 ,  □  * ,  □  ‘ ,  and  □  '  in  the  romanization  column  are  placed  after  the  last  letter  of 
the  word  in  which  they  occur,  as  in  nj[n  tlf  (ksatriy  );  lj)  S  ( cdh ’);  $  (ta ');  JTinS  (qatman  ). 

8.  Conventional  signs  are:  3  ,  romanized  by  repeating  the  preceding  word  or  phrase;  7HJ7  romanized 

as ./.  ;  7U7  ,  romanized  as  .p.  ;  _  ,  romanized  by  means  of  a  hyphen  ( -  );  I ,  romanized  by  means 

of  a  colon  (  :  ),  and  7  and  7/  ,  romanized  by  means  of  a  period  (  .  ).  The  signs  ^  and  6w*  are 

omitted  in  romanization. 

9.  The  numerals  are:  o  (0),  «  (1),  \o  (2),  sn  (3),  6  (4),  <sf  (5),  (6),  <r)  (7),  d  (8),  and  (9). 

10.  Khmer  words  are  not  written  separately  and  spacing  occurs  only  after  longer  phrases.  When 

romanizing,  the  shortest  written  form  which  can  stand  alone  as  a  word  is  treated  as  such.  This 
applies  also  to  Pali  and  Sanskrit  loan-words.  Other  loan-words  are  divided  as  the  original  language. 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

J 

alif 

AE 

( 

ayn 

BO 

// 

hard  sign  (double  prime) 

B7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

h 

grave 

El 

6 

acute 

E2 

□ 

circumflex 

E3 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

fi 

circle  above 

EA 

□ 

• 

dot  below 

F2 

0 

circle  below 

F4 
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Romanization 


1.  The  Library  of  Congress  will  continue  to  follow  the  McCune-Reischauer  system  to  romanize 
Korean.  See:  Romanization  of  the  Korean  Language:  Based  upon  its  Phonetic  Structure  by  G.M. 
McCune  and  E.O.  Reischauer  ([S.l.  :  s.n.,  1939?]),  reprinted  from  the  Transactions  of  the  Korea 
Branch  of  the  Royal  Asiatic  Society. 

2.  The  Library  of  Congress  will  designate  certain  standard  dictionaries  as  final  authorities  to  resolve 
questions  of  contemporary  pronunciation.  A  word  will  be  considered  to  be  pronounced  as 
indicated  in  those  dictionaries,  and  romanized  in  such  a  way  as  to  represent  its  pronunciation  most 
accurately. 


3.  When  romanization  rules  conflict  with  the  pronunciation  of  word,  prefer  to  represent  the 
pronunciation.  Do  not  romanize  silent  syllabic  finals, 
choun  ^  ^ 

yori 

.  I  ca 

namunnip  r  ^ 

kap  ^ 

opta  ^ 


(or  2  b|  ) 


4.  Sai  siot.  Follow  McCune-Reischauer  directions  on  romanization  of  medial  (sai)  siot,  with  these 
revisions: 

Romanize  as  nn  when  a  syllabic  final  before  i  and  yotized  vowels, 
yenniyagi  °|  <4  7 1 

honnil  3  °J 

Romanize  as  d  when  a  syllabic  final  before  all  other  vowels, 

modopta  ^  ^ 

udot  %  ^ 


Represent  a  reinforced  medial  consonant  as  it  is  pronounced,  regardless  of  written  form.  (NOTE: 
Some  dictionaries  represent  a  reinforced  medial  consonant  with  a  double  consonant:  -I  21  I  —  al 
However,  the  romanization  would  not  necessarily  show  a  double  consonant:  uikwa) 

Hancha  ^  ^  {  &  ^  } 

but  hanja  (a  measure)  ^  ^ 

silssi  ® 

yL 

p‘yongka  °  f 

munpop 
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6. 


Words  written  with  double  final  consonants  are  to  be  romanized  as  they  are  pronounced, 
huk  en 


sam 

tak 

Maksu 

Marksu 


□j-  /V 
$  — 


7.  Initial  niun  (»-)  and  liul  (s ).  Follow  McCune-Reischauer  rules  governing  initial  niun  and  liul,  with 
the  following  exceptions.  The  surname  ^  is  always  romanized  Yi,  no  matter  how  it  is  written  ( 

°l  >  al  )• 


nodong 

yoksa 

Yi  Sung-man 


3  A 


□(- 
o  l_L 


(or  ic  #  ) 
(or  ^  a(-  ) 
(or  o|  °t) 


Exception  1)  When  medial  nitin  is  followed  by  medial  liul,  they  are  generally  romanized  11. 
However,  when  syllables  beginning  with  f  or  ^  (i.e.J|-  ,  i! ,  ?j)  follow  a  vowel  or  medial  niun, 
the  sound  of  liul  is  generally  dropped.  In  such  cases,  liul  is  not  romanized. 
mullihak  ^ 

nayol  51  5!l 

punyol 
but  Koryo 


m 


9s  ® 


r^) 


Exception  2)  To  accommodate  Word  Division  Rule  1,  particles  beginning  with  the  letter  liul  are  to 
be  separated  from  other  words,  and  are  to  be  romanized  beginning  with  the  letter  r  in  all  cases. 


sae  yoksa  rul  wihayo 

B  *  f-  *\ 

charip  kyongje  ro  ui  tojon 

£  iL  e  S?  JE. 

£IE  WL 

Kim  Mari  ranun  puin 

±  El  A  t 

A  A 

8.  Following  hyphens: 

(a)  When  sounds  would  normally  change,  according  to  McCune-Reischauer  rules,  sound  change 
is  indicated  following  a  hyphen  in  the  following  instances: 

(1)  As  second  syllable  of  a  forename. 

Exception:  When  the  second  part  of  a  given  name  follows  a  vowel  sound  and  begins  with  a 
yotized  initial  liul  ( e ),  the  medial  r  is  omitted  (unless  there  is  convincing  evidence  that  the 
medial  r  is  intended  to  be  pronounced.) 


Song,  Si-yol 

£ 

Yi,  Ki-y5n 

El 

A  £ 

Kim,  Ch‘or-won 

£ 

An  Ung-nyol 

£ 

m 
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(2)  For  all  generic  terms  used  as  jurisdictions,  except  the  term  pukto : 

Kangwon-do  Vl  I?  il 

Wando-gun  *1-  s.  -E- 

(b)  When  sounds  would  normally  change,  according  to  McCune-Reischauer  rules,  sound  change 
is  not  indicated  following  a  hyphen  in  the  following  instances: 

(1)  In  a  spelled-out  cardinal  number: 

ilch‘on-kubaek-yuksipp‘al  —  ft 

(2)  Between  a  numeral  and  volume  designation: 

che  3-chip  3$ff 

4-kwon  4^ 

(3)  Between  a  year,  written  in  numerals,  and  a  suffix  or  modifier: 

10-chunyon  10il§ 

1900-yon  1 900^ 

(4)  Between  abbreviated  forms  combined  coordinately: 

chung-kodung  hakkyo  iH  ^  15 

(5)  For  the  generic  jurisdiction  term  pukto: 

Ch‘ungch‘ong-pukto  M  Jh  iM 

(6)  When  a  word  of  Western  origin  is  modified  by  a  single  character  modifier,  affix,  or 
substantive  of  Korean  or  Sino-Korean  origin: 

esei-chip  oj|  x^|  o|  *J 


9.  Words  of  Western  Origin 

(a)  The  letters  that  can  be  used  in  romanizing  words  of  Western  origin  are  limited  to  those 
allowed  by  the  McCune-Reischauer  romanization  rules.  Therefore,  while  there  may  be  a 
choice  between  letters  that  may  be  used  to  represent  a  consonant,  there  can  be  no  variation 
in  the  representation  of  vowels.  The  letters  that  may  be  used  to  represent  consonants  are: 
k,  g,  ng 
n,  1 

c  t,  ch,  j,  d 

a  1,  r,  n 

Q  m 

*  P,b 

^  s,  sh,  n,  t,  d,  p,  k 

°  ng 

*  ch,  j 

*  ch 4 

^  k‘ 

E  t\ch 

“  P‘ 

*  h 
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(b)  When  one  or  more  of  the  prescribed  letters  corresponds  exactly  to  its  counterpart  in  the 
Western  word  being  romanized,  that  letter  will  always  be  used. 

dijain  (from  the  word  design)  cj  xf  °] 

Lenin  gj|  l| 

(c)  When  none  of  the  prescribed  letters  corresponds  exactly  to  its  counterpart  in  the  Western 
word  being  romanized,  apply  the  romanization  system  strictly,  without  approximation  of 
sounds. 

Pet‘unam  uj|  je.  y- 

(from  the  word  Vietnam-,  the  romanization  system  does  not  permit  the 

use  of  the  letter  v) 

chero  x||  5. 

(from  the  word  zero-,  the  romanization  system  does  not  permit  the  use 
of  the  letter  z) 

(d)  When  the  original  Western  word  cannot  be  ascertained,  apply  the  romanization  system 
strictly. 


Word  Division 

Basic  Principles: 

1.  Each  word  or  lexical  unit  (including  particles)  is  to  be  separated  from  other  words. 

k‘alla  TV  rul  chungsim  uro  4  TV  fl-  rf:  2  3. 

2.  The  Library  of  Congress  will  designate  certain  standard  dictionaries  as  final  authorities  to  resolve 
questions  of  word  division.  A  word  found  in  these  sources  will  be  considered  to  be  a  lexical  unit, 
and  written  as  a  unit.  Words  or  parts  of  speech  not  appearing  in  these  authorities  will  be  separated 
or  connected  according  to  these  guidelines.  Then,  when  in  doubt,  prefer  to  separate. 

3.  A  compound  word  is  considered  a  combination  of  binary  words.  (A  binary  word  is  a  compound 
consisting  of  two  Chinese  characters  ( Hancha )  or  two  syllables  of  Han’gid.) 

4.  A  compound  word  is  divided  into  binary  components  according  to  euphony. 

Specific  Rules: 


Write  a  particle  ( JE)  as  a  word  separate  from 
below. 

na  nun  nae  kil  e 
noin  kwa  pada 

ch'owon  ui  kkum  ul  kudae  ege 
sae  ya  sae  ya  p'arang  sae  ya 


the  word  stem,  except  as  noted  in  (a)  through  (d) 

4  t:  lH  7A  oil 

=£  k  4  *4  4 

WW*  -I  3-  #  CH  °IMI 
XH  °l=  of  4  if  4  O}: 
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IA.  Add  a  particle  as  a  suffix  to  a  verb  stem,  adverb,  or  a  simple  inflection  of  the  verb  stem  or  adverb. 

sarang  pannun  anae  4  4  4  b  4  LH 

chago  moktoni  4  jl  4  4  Ll 

kanan  ul  iginun  pop  4  4  #  °l  7I  b  Vi 

IB.  Write  coordinated  or  multisyllabic  particles  together  as  a  word.  However,  always  separate  the 
particle  °|  (ui)  from  other  particles. 

na  mando  anida  4  4  -£  °f  4  4 

Han’guk  kojon  e  ui  ch‘odae  §  "S’  ^  °ll  —I  JS 

iltung  egenun  so  rul  sang  uro  ^  Ml  4|  b  A  %  4  —  -S. 

hangmun  uroso  ui  sahak  ^  H  —  -5=-  4  -I 

IC.  Write  independent  contracted  particles  (such  as  4! ,  *4  )  separately  from  the  preceding  word. 

yejon  en  mich‘o  mollassoyo  Ml  4  ^  4  4  -H  4  4  -2. 

kohyang  el  kanda  2  cl  ^  4  4 


ID.  When  particles  are  contracted  to  nouns  or  pronouns,  connect  them  to  those  words, 
chigum  un  nugun’ga  waso  xl  b  b  b  b  4  4  4 

nan  molla  4  !r  4 


kugon  na  to  molla 


<-1 


Write  a  simple  inflected  verb,  adjective,  or  adverb  as  a  separate  word  or  as  a  suffix  joined  to  a 
word,  according  to  the  dictionary  that  serves  as  authority  or  the  sense  of  the  element(s)  involved, 
mondong  i  t‘ul  ttae 
pam  e  ssun  insaengnon 
choguk  ul  chik‘in  yongsa 
nugu  rul  wihayo  chong  un  ullina 
simni  to  mot  kaso  palpyong  nanda 
irobSrin  kaul 
mongmariin  hujo 


4  t- 

°l 

M. 

“ti 

4 

Ml 

it  A4  iw 

±  4 

o 

EL 

4  y\ 

b  4 

i—  -1 

T  T 

3 

3 

4  °i  4 

m 

o 

b  4  4 

+  1 

b 

4  4 

5  4 

4  4 

4  4  4  4 

4  ir 

b  4 

e_ 

1 _ 

2A.  Separate  an  auxiliary  verb,  adjective,  or  adverb  and  its  inflection  from  the  word  stem, 
kago  sip'un  nae  kohyang  71  -3-  4  b  LH  -2-  4 

tt5na  on  ku  chari  e  4  *4  b  —  4  4  Ml 


nae  chan  i  nomch‘i  naida 


Ml 


•I  y  *1  M-  °l  4 


2B.  Separate  the  auxiliary  *1  £-\(hada),  the  copula®  1 4  (ida),  and  inflections  of  the  same  from  the  word 
stem  when  they  consist  of  two  or  more  syllables. 

sin  un  ch'angjoja  ida  b  iS  #  °l  4 

sarang  iraniin  pyong  4  4  4  4b 


2C.  Separate  a  gerund  form  from  the  word  stem. 

p‘urun  pyol  un  sara  issotta  b  b 


g  b  4  4  °JS14 
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3.  Write  an  imperfect  noun  as  a  separate  word,  except  as  noted  in  A-B  below. 


morankkot  p‘il  muryop 
ssirum  un  tano  ppun  anirao 

3A.  Write  a  single  syllable,  imperfect  noun  as 
inflected  verb. 

halsu  omnun  saramdul 
sinsa  sungnyb  yorobun 
nugu  rul  wihan  kosin’ga 
i  choguk  5diro  kalkot  in’ga 


U  ¥  ^ 

*l  f-  -e-  &  2  “  °[  »-i  A  2 

word  joined  to  an  attributive  adjective  or  to  a  simple 

^  ¥  Si  ¥  A  W 
&  ±  el  £ 

¥  9-  #  *1  ^ 

o|  <H  c|  s.  2  3  °J  n 


3B.  Connect  a  prefix  (  ££  si  §#  choptusa ),  such  as  the  native  Korean  prefixes  A  (/atf),  -g-  (hoi),  A*- 

(hat),  and  ¥  (p‘ut )  to  the  words  that  follow  them. 

kassumul  ^  ? 

totchogori  ^  El 

“  x)-  et 

p  ussarang  ^  1 


3C.  Write  a  single  syllable  attributive  adjective  or  prefix  as  joined  to  a  personal  pronoun  or  imperfect 
noun. 

kubun  i  sonsaeng  ida  -  ^  °l  9t  >%L  °l 

igot  i  chinsang  ida  °l  3  °l  (K  ffl  °l  cf 


4.  Write  a  derived  word  formed  by  the  addition  of  a  single  character  modifier,  affix,  or  substantive  as 
a  single  word,  whether  the  word  be  of  pure  Korean  or  Chinese  origin. 

Han’guk  kwa  Han’gugin 


pudongsan  p‘yongka 
taedosi  ui  inyom  kwa  Han’guk 
Hanbando  ui  p‘yonghwa  wa  anbo 
kyongje  ch‘odaeguk  ui  hyongsong 
hwangmuji  ka  changmikkot  kach‘i 


w  ns 

A  II  A 

PF  If 

XU  rfJ 

2-1  g  & 

A 

H  ¥  S 

A  ¥  fn 

A 

&x  m  -i 

$ 

cf  ¥  xl 

A  A  Dl 

21L 

2 

4A.  In  a  Sino-Korean  phrase,  write  a  simple  inflection  of  the  auxiliary  °\c\(hada)  and£-|c| (toeda),  and 
the  copula o |  c [(ida)  joined  to  the  word  stem, 
kodokhal  ttae  sayon  ul 
yongwonhan  saengmyong 
haengbokhan  Miguk  saenghwal 
chomyonghae  pon  uri  choguk 


mm  a 

t\<  ig  t+ 
¥  & 
aa  m  *n 


°i  ¥ 

JsL 


A  £  # 

¥  ei  nm 


4B.  Connect  a  single  character  modifier,  affix,  or  substantive  of  Western  origin  with  a  hyphen  to  a  word 
of  Korean  or  Sino-Korean  origin.  Connect  a  single  character  modifier  of  Korean  or  Sino-Korean 
origin  with  a  hyphen  to  a  word  of  Western  origin. 

Pak  Mog-wol  taep‘yo  esei-chip  ft  °ll  xll  °l  W: 

kukche  maak‘et‘ing-non  HI  Df  °f  *11  1)  ini 
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4C.  Write  an  attributive  adjective  or  a  pre-formative  character  separately  from  the  word  it  modifies. 


sin  hSnpop 

Iff  M  ft 

cho  hanul  edo  sulp‘um  i 

ft  ft  ft  ft  £  ft  ft  °l 

Han’guk  sinhak  nonmun  ch‘ong  saegin 

mm  n m  m x  & 

kundaesa  ui  chae  chomyong 

s  it  *  ft  n  mm 

kwanhan  il  yon’gu 

ft  ft  ft  ft  ft 

4D.  Write  two  coordinated  characters,  affixes,  or  substantives  together  as  an  integral  part  of  the  word. 
such‘urip  cholch‘a 
kungnaeoe  sajong 
Han’guk  sohwaga  inbo 

ch‘ongsonyon  ege  turinun  Hanoi  ft  ^  ft  °1l  A\  —  al  ft  ft  ft 


m  m  x 

HP  ft 

u  ft 

¥  It 

m  m 

m  m  % 

zvm 

ft  A  ft 

°ii  ft 

—  ei  ft 

4E.  Write  a  single  character  suffix  together  with  the  word  stem. 

kongsanch‘uk  ui  chujang  ft  H  HI  -I  i  ^ 

arumdaun  tongmuldul  ft  ■§-  ft  ft  ft  #  ft 


4F.  Add  the  binary^  ( chuui ;  used  chiefly  as  a  formative  element)  to  its  modifier  as  a  suffix, 
minjujuui  R  i  i  8 

Maruk‘usu-chuui  wa  Kidokkyo  ft  —  —  E-  i  S  ft 


5.  Write  any  binary  component  of  a  compound  as  a  single  word,  when  possible. 
t‘oji  kaeryang  chohap  ±  ttfe  ^ 


5A.  Write  a  single  character  substantive  as  part  of  the  preceding  binary  element  of  a  compound.  Write 
a  single  character  substantive  or  an  additional  modifier  together  as  part  of  the  binary  element. 


kukse  kibonpop 

mu 

5  *  y£ 

chijong  t‘onggye  chosap‘yo 

m  $ 

mst 

mm 

ft 

Han’guk  chont'ong  mokcho  konmulto 

m  b 

mm. 

mvi® 

kyeryang  kyongjehakchok  yon’gu 

nm 

g  »  m 

m 

5B. 


If  two  single  character  substantives  appear  in 
kukse  kibonpbp  non 
chijong  t'onggye  chosap‘yo  chip 
hyondae  chakka  samsipsamin  chip 
Chungguk  kojon  Hansiin  son 
kwijokche  sol  kwa  kwallyoje  non 


succession,  write  the  second  one  as  a 


mu 

$  ^  v± 

bW 

m  s 

VL  ft 

mm  ft 

m 

n  it 

wm 

EtE 

x  m 

*  m 

£  * 

mn  x 

m 

a  m 

U  & 

ft  w 

mu 

separate  word. 


m 


5C.  Write  a  single  character  noun  as  a  separate  word. 

hyon  haengjong  ha  e  che  kisul  M  ft  R  T  °ll 

Moktan’gang  haeng  y51ch‘a  15  ft  VX  It  5[J  ^ 

Han’guk  Chunggong  kan  ui  kyoyok  ^  ^  ft  50  -I 


ft  ft  m 
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5D.  Borrowed.  ( Western )  words  or  terms:  When  it  can  be  determined  that  a  word  or  words  in  Korean 
consisted  of  more  than  one  word  in  the  original  language,  apply  the  following  guidelines. 


If  they  appear  separately,  write  them  separately. 

syeip'u  op  a||  o|  z.  tl 

e||  o|  ^  aH  Lr| 


t'eibul  maeno 


If  they  appear  without  spaces,  write  them  as  a  single  word.  (It  is  useful  to  also  provide  an  access 
point  in  which  the  word  or  words  are  separated  in  the  same  manner  as  in  the  original  language.) 
syeip'uop  (originally  shape  up)  -Ml  °|  .5. 

(and  another  access  point  from  syeip'u  op) 
t'eibulmaeno  (originally  table  manners)  N|  °l  -g-  °H  M 

(and  another  access  point  from  t'eibul  maeno) 

If  they  appear  with  a  center  dot  (  •  ),  write  them  separately, 
syeip'u  op  -HI  °l  —  •  tj 

t'eibul  maeno  Ej|  o|  -g-  .  u-j 

6.  Personal  Names 

6A.  Write  a  family  name  consisting  of  two  characters  as  a  single  word.  Hyphenate  a  given  name  in  two 
characters  or  a  courtesy  name  (in  place  of  a  given  name),  and  capitalize  only  the  first  letter  of  the 
first  syllable. 


Ch‘oe  Ch‘i-won 

m  Sit 

Yi  Kwang-su 

°l  #  4- 

Kim  So-wol 

£ 

m  a 

Namgung  Kak 

EE 

Sonu  Chong-won 

6B.  Write  a  pseudonym  or  other  assumed  name  as  one  word 
Kim  Sakkat 
Ch'ungmugong 


# 

&  it 


6C.  Write  a  Buddhist  priestly  or  posthumous  name  as  one  word. 
Iryon  — 

Sok  Myongjong  P  IE 


6D.  Write  a  reign  title,  temple  name,  or  title  of  nobility  as  one  word  without  a  hyphen. 
Taejo  £  ftl 

Yi  Sejong  ^  "K  ^ 

Kwanggaet'o  Wang  ^  1^  i  it 

Hyegyonggung  Hong  Ssi  ®  ®  Ft 

Chang  Huibin  ^  MM 

Hyonu  Haengja  ^  ®  ^  ^ 
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7.  Corporate  Names,  Geographical  Names,  Names  in  Publication  Titles,  etc. 


7A.  Treat  a  corporate  name  also  as  a  binary  compound  when  possible.  Write  separately  as  binary 
elements  general  terms  such  as*f*  §( Hakhoe ),^  (Hakkwa),  Q  #(. Kyohoe ),  etc. 


Han’guksa  Hakhoe 

m  m  £ 

(. Society  of  Korean  History ) 

Han’guk  Sahakhoe 

m  m 

£*£ 

w 

(World  History  Society  of  Korea) 

Nasong  Hanin  Changno  Kyohoe 

ol 

Kugo  Kungmun  Hakkwa 

mm 

mx 

Nodong  Kijun  Chosaguk 

%  m 

mm 

Han’guk  Hyum5nisut‘u-hoe 

U  B 

L-l 

#  3: 

mn 

Hi 


JL  *| 


7B.  Write  a  proper  name,  term  of  address,  or  publication  title  separately  from  its  modifier  and  also 
separately  from  the  word  it  modifies. 


Wollam  Yi  Sang-jae  Ong 
Nanjung  ilgi  ch‘o 
Hungs  on  Kun 
Kwibong  chip 
Wonye  Hakkwa  chi 
Taegu  Maeil  Sinmunsa  sa 


mu 

b  le 

m  § 

m 

mm 

m 

^*4 

Mr 

±  m 

m  b 

»T 

ITS 


$t  m  £ 


7C.  Hyphenate  a  generic  term  used  as  part  of  the  name  of  a  jurisdiction,  and  indicate  phonetic  changes, 
except  in  the  case  of  the  termJhiM(pw/rfo). 

Kangwon-do  VX  IE  iM 

Kyongsang-pukto  db  ifi 

Taegu-si  X.  ®  rfi 

Kahoe-dong  ^  ^  ^ 


7D.  Write  a  generic  term  for  a  type  of  topographic  feature,  architectural  construction,  or  a  corporate 
entity  used  as  a  part  of  a  proper  name,  together  with  its  name. 

Hallasan  M  ^  ill 

Naktonggang  ^  & 

Tongnimmun  3S  XL  H 

Tonga  Ilbosa  ^  £5  B  ^ 

YurisSng 


o 

TT 


2| 


7E.  Treat  a  generic  term  for  a  topographical  feature  or  a  jurisdiction  also  as  a  binary  element  when 
combined  with  another  word. 

T‘aebaek  sanmaek 
Anju  Kunminhoe 
Wonju  kunji 

Sojangdae  Manwol  Sanjong 


X  £ 

ill 

£  tt\ 

iRI 

IE  tt\ 

m  a± 

£  mn  dj  m 
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8.  Abbreviated  Forms: 


8A.  Write  contractions  which  include  proper  names  as  a  single  word. 

Yongamsaji  tF  tit  <  §1  ^  +  tit  ^ 

Soul  T‘ukpyolsirip  Namsan  Tosogwan  %  $3  B*J  rfr  ft  ^  ill  @ 

<  A  %  glj  rfi  +  rfJ  ft  ..A 

8B.  Write  an  abbreviated  or  contracted  proper  name,  Korean  or  foreign,  as  a  single  word, 
onul  ui  Pukhan  -2-  ilr  -I  ft 

Chon‘gyongnyon  ^  ^ 

Chunggongkwon  ui  changnae  ^  -  I  -I  It?  ^ 

8C.  Hyphenate  abbreviated  forms  combined  coordinately.  Do  not  indicate  phonetical  changes.  Write 
a  single  character  substantive  as  part  of  the  final  element  in  that  combination. 
ch‘oesin  Pul-Han  sajon 
Soul  Chung-Kodung  Hakkyo 
sinyok  Sin-Kuyak  chonso 
chung-tanp‘yon  sosol 
Paekhwabon  Tang-Song  sanmunson 
Myong-Ch‘ong  p'ilgi  kosa  sonyok 


mm 

mm 

a  # 

ft®# 

m  ib  « 

±m 

ft  5s  IS 

ft 

ft  is  # 

mxm 

iJ  A 

jl  ft  *! 

9.  Numerals: 

9A.  In  romanizing  numbers  and  adjacent  words,  create  binaries  whenever  possible.  Write  the  pre- 
formative  element  sfc  ( che )  joined  to  the  following  number  to  form  a  binary. 
che-1  m  1 

cheil  — 

che-3  segye  3  C  #  ^ 

chesam  segye  M  —  t5  I? 

However,  prefer  to  join  a  number  to  a  suffix  or  generic  term  that  follows  it  with  a  hyphen  (for 
example; tX(ch‘a), @ (hoe),  ^(pon),^F-(yon)).  Do  not  indicate  phonetical  changes  after  the  hyphen. 
In  such  cases,  when  (che)  precedes  the  number,  separate  it  from  the  number, 
che  1-ch‘a 

che  ilch‘a  ^ 

che  3-chip 

In  more  complex  situations,  still  attempt  to  create  binaries  when  possible, 
cheil,  i  Konghwaguk  —  ft  li 

che-1,  2  Konghwaguk  H  1  •  2  ft  fD  t§3 


9B.  Write  a  spelled-out  cardinal  number  as  one  word.  In  a  number  over  one  hundred,  separate  by 
hyphens,  without  phonetic  changes,  each  unit  of  ten  (  -f-  sip),  hundred  (  H  pack),  thousand  (ft 
ch‘on),  etc. 

ilch‘on-kubaek-yuksipp‘allyon  ft,  ✓n  A 
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9C.  Treat  a  contracted  form  of  a  cardinal  number  or  calendar  year  as  one  word. 

yuksibinyon  saop  kyehoek  ^  ~  ^  ^  1+  I1] 

62-yon  saop  kyehoek  f  i  Irf"  IfJ 

tosi  ch‘ilsibil  ®  ^  ±  +  ~ 

tosi  71  SP  rf?  71 


9D.  In  case  of  certain  contracted  numerals  that  have  been  firmly  established  through  common  usage, 
prefer  that  form. 

sa-ilgu  haksaeng  uigo  29  '  h  f§  ^ 

o-illyuk  kunsa  hyongmyong  5  •  ✓v  1?  ^  ap 

samil  undong  —  31 

9E.  In  writing  native  Korean  numbers,  also  attempt  to  create  binaries  when  possible. 


se  pontchae 
ch‘aek  nekwon 
som  han’gun 
so  yol  mari 


-Ml  £  *11 
Ml  & 

fcf-  .H 


g  □(-  a| 


Capitalization 

1.  Each  separately  written  word  of  a  corporate  name  (except  particles),  or  an  abbreviation  thereof, 
is  capitalized. 

Han’guk  Ilbosa  ^  ®  B  ^ 

Taehan  Sanggong  Hoeuiso  ^  ^  ®  i  W  M  FJf 

2.  Each  separate  word  of  a  personal  name  is  capitalized. 

Yi  Kwang-su  ^  ^ 

Chong  Yon-hui  ® 


Titles  and  terms  of  address  are  capitalized. 
Pak  Taet‘ongnyong 
Cho  Yong-gi  Moksa 


ft 

5-  %  *  4 


Each  separately  written  word  of  a  geographic  name  is  capitalized.  An  abbreviated  geographical 
name  is  capitalized.  An  abbreviated  geographical  name  is  capitalized  in  coordinate  compounds 
and  at  the  beginning  of  other  compound  words.. 

Soul  T‘ukpyolsi 
Kyongbuk  Sup‘il  Tonginhoe 


4 


3  4 


*  •£  Al 

^  i! 


oi  *| 


The  first  word  of  the  title  of  a  book,  periodical,  or  series  is  capitalized. 
Hyondae  kukchepop  ^  it  ®  P* 

Silch‘on  munhak  ui  sijip  i!  ^  ^  ^  -I  ^ 
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6.  Names  of  dynasties  are  capitalized. 
Yijo  obaengnyon 
Myong-Ch‘ong  sidae 


29E  w  51$ 

w  ft  m  it 


7.  A  word  derived  from  a  proper  name  is  capitalized  only  if  the  name  retains  its  full,  original  meaning. 

Yangmyonghak  yon’gu  Bf] 

Hyondae  wa  K‘urisuch‘yan  ui  sinang  CH  £1  —  al  —  ^  tJ  cl" 

8.  Abbreviated  forms  combined  coordinately  are  capitalized  if  called  for  by  these  guidelines. 

Sin-Kuyak  Songso  St  IB  § 

Siryong  Pul-Han  sajon  el  -§■  "S'  ^  A(" 

9.  In  all  other  cases,  follow  the  directions  found  in  the  officially  designated  style  manual. 


Punctuation 

1.  A  centered  point  (  •  )  indicating  coordinate  words  is  generally  transcribed  as  a  comma,  except 
where  the  rules  require  a  hyphen. 

Chung-Kodung  Hakkyo  ^  •  rS  5#  ^  12 

Nam-Pukhan  kyongje  hyomnybk  T) 


In  other  instances,  it  may  be  transcribed  or 
sa-ilgu  ui  minjungsa 
Chang  Tae-uk  ch5 
Iryop,  Iltang  sihwajip 


not  transcribed,  depending  on  the  context. 

0  •  —  A  *1 
ASP  ■  # 

-  m  •  B  £  mi 


2.  Brackets  (  r  ...  J  )  used  in  the  manner  of  quotation  marks  (  "  ...  "  )  are  transcribed  as  the  latter. 
"Munhak  kwa  chisong"  siinson  £d  ££  J 

"Si  wa  haebang"  tongin  sijip  r  ^  *1  Si  J  -i-  Al 


Dictionaries  to  be  Used  as  Authorities  in  Korean 

To  determine  standard  contemporary  pronunciation  in  South  Korea  and  North  Korea: 

Nam,  Kwang-u.  Han’gugo  p ‘yojun  parum  sajon.  (Kyonggi-do  Songnam-si:  Han’guk  Chongsin 
Munhwa  Yon’guwon,  1984) 

m  .  m mm  & %  » m . 

Hyondae  Chosonmal  sajon.  Che2-p‘an.  ([Pyongyang]:  Kwahak,  Paekkwa,  Sajon  Ch‘ulp‘ansa,  1981) 
^  i  d  #  4  2  . 
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As  the  basis  for  word  division  decisions  for  contemporary  publications  from  South  Korea  and  North 
Korea: 

Sinpfon  kugo  taesajon.  (Soul  T‘ukpyolsi:  Taeyong  Ch‘ulp‘ansa,  1976) 

il  £  q-  <H  4  2  . 

Hyondae  Chosonmal  sajon.  (1981) 

£  <=H  '  &  if  4  &  . 

Supplementary  source  for  word  division  decisions  for  classical  Korean  publications: 

Ko  popchon  yongojip .  ([Seoul]:  Popchech‘6,  1979) 

£  vi *  mmn . 

To  determine  reading  and  pronunciation  of  Chinese  characters: 

Chang,  Sam-sik.  Tae  Han-Han  sajon.  (Soul  T‘ukpyolsi:  Songmunsa,  1964) 

5i  Ei.  x  mm  »  * . 

Sin  chajon.  ([Seoul]:  Sinmun’gwan,  1915;  Reprint:  [Seoul]:  Cho  Yong-sung,  1973) 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

5 

alif 

AE 

C 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

r: 

breve 

E6 

ALA-LC  Romanization  Tables 


page  111 


Korean 


Flowchart  for  Applying  Korean  Word  Division  and  Romanization 


Word  Division 
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Romanization 
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Kurdish 

(in  Perso-Arabic  Script) 


Letters  of  the  Alphabet 

Consonants 

Initial 

Medial 

Final 

Alone 

Romanization 

9 

9 

’  (alif) 

mi 

SmmJ 

b 

mi 

v 

SmrnJ 

P 

J 

G. 

CLi 

t 

3? 

e- 

G 

j 

2* 

S*m 

G 

ch 

c- 

c 

h 

£%m 

t- 

C 

kh 

d 

V 

V 

-/ - 

V 

r 

J 

Jr 

Jr 

J 

z 

J 

Jr 

Jr 

J 

zh 

-UJ 

wJjJm 

u>**- 

s 

Jjj 

JjJm 

sh 

u*3 

s 

• 

X 

JZ. 

e- 

t 

‘  (ayn) 

X 

JEL 

e- 

t 

gh 

J 

Jl 

f 

JL 

<JL 

<J£ 

V 

iL 

vi- 

vi 

q 

iL 

k 

3 

g 

J 

1 

J- 

J 

l 

JQ 

JO. 

r- 

P 

m 

mi 

-4- 

j>- 

o 

n 

3 

J- 

3- 

3 

w 

a> 

•e 

<\ 

a 

h 

mi 

cr* 

cS 

y 

Vowels 

Initial 

Medial 

Final 

Alone 

Romanization 

•e 

o 

a 

1 

L 

L 

1 

a 

3 

>- 

> 

u 

33 

u 

- 

- 

- 

- 

i 

-4-4 

-4-4. 

CTT< 

T* 

1 
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RULES  OF  APPLICATION 

1.  When  v5  represents  the  idafah ,  romanize  it  as  -i  following  a  consonant  and  as  -y  following 
a  vowel.  Attach  this  romanization  to  that  of  the  preceding  word. 

2.  Romanize  the  conjunction  j  as  wa. 

3.  Romanize  the  particles  ^  and  as  ba-  and  la-.  Attach  this  romanization  to  that  of  the 
following  word. 

4.  Romanize  the  zher  (unwritten  vowel)  as  i. 

5.  Use  the  single  prime  (')  in  the  following  situations: 

(a)  To  separate  two  letters  representing  two  distinct  consonantal  sounds,  when  the 
combination  might  otherwise  be  read  as  a  digraph; 

(b)  To  mark  the  use  of  a  letter  in  its  final  form  when  it  occurs  in  the  middle  of  a  word. 

6.  For  the  romanization  of  Kurdish  dialectical  variants  bearing  diacritics  (e.g.,  T,  ),  use  the 
romanization  of  the  letter  without  the  diacritic. 

7.  Romanize  foreign  words  occurring  in  a  Kurdish  context  and  written  in  Kurdish  letters 
according  to  the  present  tables  and  rules. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character  Name 


USMARC  hexadecimal  code 


/ 


< 


soft  sign  (prime) 

A7 

alif 

AE 

ayn 

BO 

Character  modifiers  Name 


USMARC  hexadecimal  code 


□  macron 

□  dot  below 


E5 

F2 
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* 

Consonants  (See  Note  1) 

Vernacular  Romanization  When  Initial  Romanization  When  Final 


n 

k 

k 

2 

kh 

GO 

ft 

kh 

00 

V 

ng 

ng 

$ 

ch 

(t) 

fl 

s 

(t) 

? 

s 

(t) 

u 

ny 

(See  Note  2) 

- 

o 

d 

t 

o 

t 

(t) 

th 

(t) 

m 

th 

(t) 

u 

n 

n 

u 

b 

P 

0 

P 

(P) 

C3 

ph 

(P) 

ei 

f 

(P) 

UJ 

ph 

(P) 

ui 

f 

(P) 

XJ 

m 

m 

d 

y 

- 

s 

r 

(n) 

1 

(See  Note  3) 

(n) 

0 

V 

(See  Note  4) 

- 

m 

h 

(See  Note  5) 

- 

3 

*  (ayn)  (See  Note  6) 

- 

h 

_ 

Obsolete  Consonants 


Vernacular 


Romanization 
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Vowels  (See  Note  7) 


nacular 

Romanization 

Vernacular 

Romani 

□  S,  □ 

a 

CD?Z,  tf 3 

9 

□  2 

a 

fi,  D9 

9 

o 

o  o 

□ 

i 

cat,  c □ 

oe 

rv 

□ 

T* 

1 

rv 

CD 

ce 

F) 

□ 

ir 

fqjs,  qj 

ia 

rv 

□ 

\x 

rqj,  qj 

la 

q 

u 

fDSS,  fD9 

ira 

u 

fQ9 

Ira 

for,  cd 

e 

□  OS,  DO 

ua 

CD 

e 

□  0,  DO 

ua 

CCDS,  CCD 

as 

tD,  *!□,  Do 

ai 

CCD 

as 

□20,  02 j 

ai 

Ids,  □ 

o 

f02 

ao 

\a 

5 

& 

am 

Special  Symbols 


Vernacular 


Romanization 


7 

!) 

7a7 


(syllable  repeated) 
la 


Notes 

1.  Some  consonants  do  not  occur  as  finals,  except  in  loan-words.  The  romanization  for  these 
consonants  is  enclosed  in  parentheses  in  the  table. 

2.  In  final  position,  o  follows  as  vowel,  closing  the  diphthong,  and  is  romanized  as  i.  See  the  vowel 
table  ( ai  and  ai).  In  non-initial  position,  o  is  often  written  in  a  secondary  form,  j  .  See  the  vowel 
table  ( ia  and  la). 


ALA-LC  Romanization  Tables 


page  117 


Lao 


3.  The  combining-form  of  a  is  ^  .  It  is  not  romanized  except  when  subscript  to  m  ,  thus:  »  aciju 
( khitsatTan ),  vvts  ( lai ). 

4.  In  initial  position,  o  is  romanized  as  v,  thus:  oo  (vat). 

When  medial  3  is  followed  by  a  vowel,  it  forms  a  cluster  with  the  initial  consonant,  and  is 
romanized  as  w,  thus:  232  ( khwa ),  cc%y)  (khwaeng). 

When  medial  3  is  followed  by  a  final  consonant,  it  is  romanized  ua,  thus:  U3 n  (buak),  030  (dual). 

When  3  is  in  final  position  following  a  vowel,  it  is  romanized  as  o,  thus:  tna  (nao),  cc  r>3  (kcEo).  See 
also  the  vowel  table  (ua  and  ua).  The  forms  §3  and  Sa  are  romanized  as  iu  and  Tu,  and  the  forms 
□Ja  and  ftp  are  romanized  as  Teo. 

5.  in  is  not  romanized  before  u  (ny),  v  ( n ),  is  (m),  s  (r),  a  (/),  and  3  (v),  thus:  in tn  (nya).  inn  and 
inu  are  often  written  as  ligatures,  thus:  wj  (n)  and  on  (m). 

6.  3  is  used  in  initial  position  for  syllables  beginning  with  a  vowel,  thus  sn  (fTk).  When  not  in  initial 
position,  3  is  a  vowel.  See  the  vowel  table  (p,  p,  ua,  ua). 

7.  In  the  vowel  columns,  □  shows  the  position  of  the  consonant  relative  to  the  vowel.  In  all  cases 
the  romanization  of  the  vowel  is  intended  to  follow  the  romanization  of  the  consonant  even  though 
parts  of  the  vernacular  representation  of  the  vowel  may  be  written  before  the  consonant. 

8.  The  numerals  are:  o  (0),  9  (1),  (,  (2),  fi  (3),  4  (4),  &  (5),  (5  (6),  n  (7),  2  (8),  and  w  (9). 

9.  In  Lao,  words  are  not  written  separately.  In  romanization,  however,  the  shortest  written  form 
which  can  stand  alone  as  a  word  is  treated  as  such.  This  applies  also  to  Pali  and  Sanskrit  loan¬ 
words.  Other  loan-words  have  the  same  division  as  the  original  language.  Tonal  marks  (□,  □,  □, 
5)  are  not  romanized. 

10.  When  the  pronunciation  requires  one  consonant  to  serve  a  double  function,  at  the  end  of  one 
syllable  and  at  the  beginning  of  the  next  syllable,  it  is  romanized  twice  according  to  the  respective 
values,  thus:  »  (khitsatTan). 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

M 

digraph  AE  (upper  case) 

AS 

CE 

digraph  OE  (upper  case) 

A6 

1/ 

U-hook  (upper  case) 

AC 

< 

ayn 

BO 

as 

digraph  ae(lower  case) 

B5 

oe 

digraph  oe  (lower  case) 

B6 

u-hook  (lower  case) 

BD 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

macron 

E5 

5 

right  cedilla 

F8 
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Lepcha 


Vowels  and  Diphthongs  (see  Note  1) 


8 

a  (see  Note  2) 

83 

u 

S( 

a 

8 

*% 

e 

$8 

i 

(8 

o 

58 

T 

1 

( l 

o 

U 

8 

a 

Consonants  (see  Note  3) 

Gutturals 

Palatals 

Cerebrals 

Dentals 

^  ka 

■0 

ca 

^  ta 

*  ta 

^  kha 

D 

tsa 

V)  tha 

b  tha 

&  ga 

X 

cha 

da 

*  da 

>  ha 

c* 

tsha 

dha 

na 

V 

ja 

4 

za 

na 

Labials 

Semivowels 

Sibilants 

Aspirate 

^  pa 

£ 

ya 

G*  sa 

V  ha 

w  pha 

r 

ra 

6  sa 

fa 

N 

la 

A  ba 

A 

va 

^  ma 

wa 

Some  Consonant  Clusters 

5 

kla 

O 

bla 

gla 

mla 

n 

pla 

X 

hla 

fla 

Notes 


1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  exclude  8  .  No  distinction  between  the  two  is  made  in 
romanization. 

ka  ki 

2.  8  is  also  used  as  a  glottal  stop  and  t  is  added  to  it  in  a  cluster.  It  is  romanized  ‘a. 

iv\>  ‘ayuk 
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3.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
romanization,  except  when  another  vowel  is  indicated  by  its  appropriate  sign. 


RULES  OF  APPLICATION 

1.  When  ft,  %,y  and  N  are  used  in  the  final  position  (i.e.,  preceded  by  a  vowel  but  not 
followed  by  one),  they  assume  different  shapes  and  are  used  as  superscripts  above  the  preceding 
syllables. 


kak 

£ 

kap 

kan 

£ 

kam 

kat 

£ 

kar 

kan 

£ 

kal 

2.  When  t  is  added  to  another  consonant  or  consonants  in  a  cluster  (i.e.,  without  any  intervening 
vowel),  it  assumes  the  shape  u  and  is  added  to  the  preceding  consonant  or  consonants. 

Cd  kya  mya  tj?  klya 

3.  When  Y  is  added  to  another  consonant  or  consonant  in  a  cluster  (i.e.,  without  any  intervening 

vowel),  it  assumes  the  shape  )  and  is  added  to  the  preceding  consonant  or  consonants. 

€,  kra  gra 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

< 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

0 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

□ 

dot  below 

F2 
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Vowels  and  Diphthongs  (see  Note  1) 


a 

a 

a? 

ai 

1 

a 

a? 

o 

t 

i 

|r 

au 

3 

u 

r 

5 

a ; 

e 

e 

Consonants  (see  Note  2) 

Gutturals 

Palatals 

Dentals 

z 

ka 

5»l 

ca 

<3 

ta 

XL 

kha 

E 

cha 

tha 

V 

ga 

’SI 

ja 

S 

da 

4 

gha 

jha 

V 

dha 

<3* 

ha 

9c 

ha 

2 

na 

Labials 

Semivowels 

Sibilants 

Aspirate 

CO  pa 

a 

ya 

IT  sa 

C  ha 

oo  pha 

X 

ra 

oi  sa 

^  ba 

la 

bha 

*24 

va 

3  ma 

Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  exclude  a  .  No  distinction  between  the  two  is  made  in 
romanization. 

Z  ka  t  ki 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
romanization,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  _  )  called  sa-i. 
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RULES  OF  APPLICATION 


1. 


-1  ( mukphreh )  is  used  to  indicate  a  slight  breathing  after  a  vowel.  It  is  romanized  h. 

Z&  kahyo  pihre 

•3,5  tahma  ^  muhla 


2.  —  ( kemphreh )  is  used  to  make  a  vowel  slightly  longer  than  usual.  It  is  romanized  ’. 


& 


yuma 


tha’be 


3.  When  Z,  *3%  <3,  2,  Q,  3,  X,  and  ^  are  used  in  the  final  position  (i.e.,  preceded  by  a  vowel  but  not 
followed  by  one),  they  assume  different  shapes  and  are  used  as  superscripts  above  the  preceding 
syllables. 


zr 

kak 

Z" 

kap 

z* 

kail 

Z* 

kam 

Z 

kat 

Z 

\ 

kar 

2 J 

kan 

2 v 

kal 

4.  When  5  is  added  to  another  consonant  or  consonants  in  a  cluster  (i.e.,  without  any  intervening 
vowel),  it  assumes  the  shape  £  and  is  added  to  the  preceding  consonant  or  consonants. 

Zr  kya 


5.  When  X  is  added  to  another  consonant  or  consonant  in  a  cluster  (i.e.,  without  any  intervening 
vowel),  it  assumes  the  shape  A  and  is  added  to  the  preceding  consonant  or  consonants. 

?  kra 

6.  When  V  is  added  to  another  consonant  or  consonants  in  a  cluster  (i.e.,  without  any  intervening 
vowel)  it  assumes  the  shape  ^  and  is  added  to  the  preceding  consonant  or  consonants. 

Ztu  kva 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

apostrophe 

27 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

□ 

dot  below 

F2 
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Malay 

(in  Jawi-Arabic  Script) 

'f 


Letters  of  the  Alphabet 


Initial 

Medial 

Fsnal 

1 

L 

L 

-d 

V--*. 

J 

J 

3* 

£- 

2* 

r 

^s. 

c- 

6 

i 

X 

v 

-r 

V 

J 

Jr 

Jr 

-aU. 

Jjj 

jIl 

U"- 

-4*3 

-^X 

U**- 

-4^3 

L 

Jo. 

JL 

Jiw 

Jx 

X 

A 

e- 

X 

S3. 

e- 

£ 

A 

e- 

A 

Ji 

iL 

<3- 

Ja 

JL 

S- 

i 

viL 

J 

v»i- 

J- 

J 

l 

J- 

JO 

A. 

P* 

-j 

“L 

o~ 

mJ 

o~ 

JO) 

■e 

A  «  A 

3 

j- 

>■ 

-j 

<_r- 
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Alone 

Romanization 

1 

omit  (see  Note  1) 

SmJ 

b 

cltf 

t 

CL j 

*th,  s  (see  Notes  2) 

e 

j 

5 

c 

c 

h 

c 

kh 

d 

i 

*dh,  z  (see  Notes  2  and  3) 

r 

J 

z  (see  Note  3) 

O" 

s 

v>“ 

sy  (see  Note  3) 

*§,  s  (see  Notes  2  and  3) 

U*3 

*$,  d  (see  Notes  2  and  3) 

1= 

*(,  t  (see  Notes  2  and  3) 

Ji, 

*z,  1,  z  (see  Notes  3  and  4) 

6 

(omit;  see  Note  3) 

6 

gh  (see  Note  3) 

i 

ng 

c_fi 

*f,  p  (see  Note  3  and  5) 

o 

*q,  k  (see  Note  2) 

o 

P 

oj 

k 

g 

J 

1 

? 

m 

0 

n 

o 

ny 

o  <  b 

h 

w,  u,  o,  au  (see  Note  6) 

*3 

y,  i,  e,  ai  (see  Note  7) 
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Vowels  and  Diphthongs 

1  □  a 

3  0  u 

s?  □  i 

e 

e,  ai 

j  jo  o,  au 

Notes 

1.  For  the  use  of  alif  see  rules  3-5. 

2.  Letters  in  the  romanization  column  marked  with  an  asterisk  (*)  represent  the  romanized 
value  of  the  equivalent  Jawi  letter  when  it  occurs  in  Arabic  words  (not  Arabic  loan  words). 
The  letter  not  designated  with  an  asterisk  represents  the  proper  romanization  value  for  the 
letter  when  it  occurs  in  Malay  words.  The  boundary  between  words  that  are  Arabic  loans 
and  those  that  are  foreign  Arabic  terms  used  in  Malay  context  is  not  always  easy  to  draw. 
Common  usage  for  the  types  of  literature  in  which  such  words  appear  should  always  be 
followed. 

3.  Jawi  letters  typically  found  only  in  Arabic  and  Arabic  loan  words. 

4.  Some  words  of  Arabic  origin  with  the  letter  have  come  into  the  language  with  the 
equivalent  /  (e.g.,  lahir );  others  with  the  equivalent  z  (e.g.,  zalim). 

5.  The  letter  ^  is  often  used  as  a  shorthand  way  of  writing  »-*.  When  this  is  clearly  the  case, 
the  letter  ^  should  be  romanized  as  p. 

6.  On  3  (’ wau ),  see  rules  6  and  8b. 

7.  On  ^  (yad),  see  rules  7  and  8b. 

RULES  OF  APPLICATION 

1.  Jawi  orthography  is  far  from  standard,  particularly  in  handwritten  documents.  Nevertheless,  for 
the  purposes  of  cataloging,  it  is  essential  to  standardize  the  romanized  form  of  every  lexeme. 
Two  widely  accepted  standards  for  writing  Malay  in  the  Latin  script  exist,  the  Indonesian  and 
the  Malaysian.  In  this  table,  the  Indonesian  standard,  referred  to  as  Ejaan  Bahasa  Indonesia 
yang  Disempurnakan  has  been  employed. 

2.  Arabic  words  (not  Arabic  loan  words)  appearing  in  a  Malay  text  are  analogous  to  French  words 
and  expressions  in  a  Russian  novel  or  Latin  phrases  in  a  Catholic  theology  text,  and  their 
distinctness  should  be  preserved  by  transliterating  them  in  accordance  with  the  rules  goberning 
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the  romanization  of  the  relevant  language,  Arabic  or  Malay.  Malay  words  of  Arabic  origin 
whose  orthography  is  the  same  as  the  Arabic  might  therefore  be  romanized  differently  at  different 
points  in  the  same  text.  Thus  the  word  will  be  romanized  kudus  when  it  appears  as  a 

Malay  word,  but  qudus  when  it  is  used  as  an  Arabic  term  or  in  an  Arabic  phrase. 

3.  I  ( alif ),  3  ( wau ),  and  (yad)  are  used  to  support  *  (hamzah);  when  so  used  these  letters  are 
not  represented  in  romanization.  See  rule  8b. 

4.  At  the  beginning  of  a  word,  alif  represents  an  initial  vowel  or  diphthong  and  is  romanized 

accordingly  as  a,  e,  i,  u,  o  ( _?• ),  au,  (  ),  e,  ( —?} ),  or  ai  ( — ).  Following  a  consonant,  alif 

represents  the  vowel  romanized  as  a. 

5.  The  optional  alif  gantung  (for  example,  in  the  word  (for  example,  in  the  word  & ),  when  used, 
does  not  change  the  romanized  form  of  the  word. 

6.  The  letter  j  (wau)  is  used:  (a)  to  represent  the  consonant  romanized  as  w;  (b)  to  represent  the 
vowels  and  diphthong  romanized  as  u,  o,  and  au;  and  (c)  to  support  ?  (hamzah)  (see  rule  8b). 

7.  The  letter  ^  (yad)  is  used:  (a)  to  represent  the  consonant  romanized  as  y;  (b)  to  represent  the 
vowels  and  diphthong  romanized  as  i,  e  and  ai;  and  (c)  to  support  ?  (hamzah)  (see  rule  8b). 

8.  9  (hamzah) 

(a)  In  Arabic  words,  and  most  Arabic  loans  where  it  is  found,  hamzah  is  romanized  according 
to  the  rules  for  Arabic  (including  use  of  the  non-alphabetic  mark  ’  (alif). 

(b)  Hamzah  used  to  separate  contiguous  vowels,  supported  by  •,  or  3  or  ,  is  not  represented 
in  romanization  (for  example,  lain  ,  laut  ^3^  , permintaan  J  ^ ). 

(c)  Hamzah  replacing  initial  alif  in  vowel-initial  words  to  which  the  prefix  Ice-  or  se-  has  been 
appended  is  represented  by  the  vowel  value  that  it  elided  (keenam  ,  seindah  , 
seorang  ^3^)* 

(d)  A  hamzah  used  after  the  reduplicating  numeral  2  in  vowel-final  roots  followed  by  the 
suffixes  -an  or  -i  is  not  represented  in  romanization. 

(e)  The  occasional  use  of  hamzah  to  represent  a  word-final  glottal  stop  (for  example,  *  ) 

is  romanized  as  k. 

?.  Words  doubled  with  the  number  2  should  be  written  out  in  full  (for  example:  mata-mata  Y^Lo  ). 
When  the  root  word  ends  in  a  consonant  such  that  an  appended  -an  or  -i  suffix  must  reduplicate 
the  final  consonant,  the  doubled  consonant  is  not  represented  in  the  romanization  (for  example: 
rumput-rumputan  ). 
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10.  In  cases  where  the  postpositive  itu  or  prepositive  yang  are  joined  to  the  preceding  or  following 
word,  they  should  be  romanized  as  separate  words. 

11.  Alif  maqsur ah,  waslah,  maddah,  shaddah,  and  tanwm  are  not  commonly  used  in  Jawi  script 
except  in  Arabic  words  or  phrases.  If  they  do  not  appear  as  adornment  on  Malay  words,  they 
are  not  represented  in  the  romanization.  Ta  marbutah,  romanized  as  t,  is  not  properly  a  Jawi 
script  character  either,  but  sometimes  it  may  be  found  in  Malay  texts. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

alif  (hamzah) 

AE 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

breve 

E6 

□ 

• 

dot  below 

F2 
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Malayalam 


Vowels  and  Diphthongs  (see  Note  1) 


cm 

a 

U 

r 

o 

(315) 

a 

to 

1 

o 

V 

a  (see  Note  2) 

e 

& 

i 

T- 

e 

gDD 

1 

61(1^) 

ai 

£ 

U 

61 

o 

u 

610 

5 

S3 

r 

o 

61D 

au 

Consonants  (see  Note  3) 

Gutturals  Palatals  Cerebrals  Dentals 


<a> 

ka 

nj 

ca 

s 

ta 

(0) 

ta 

60 

kha 

oO 

cha 

o 

tha 

LO 

tha 

CO 

ga 

S 

ja 

(U) 

da 

8 

da 

op  | 

gha 

rovo 

jha 

cue 

dha 

(JO 

dha 

613 

ha 

610) 

ha 

6TT) 

na 

CD 

na 

Labials 

Semivowels 

Sibilants 

Aspirate 

nJ 

Pa 

QQJ 

ya 

(JO 

sa 

on  ha 

rvO 

pha 

CO 

ra 

no 

sa 

601 

ba 

O 

ra 

oru 

sa 

e 

bha 

oo 

tta 

(see  Note  4) 

it 

ma 

ej 

la 

v 

OJ 


la 

la 

va 


Anusvara 
(see  Note  5) 


Visarga 


Avagraha 


m 


(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  When  o  is  used  in  combination  with  the  vowel  u  (§),  the  combination  is  also  transliterated 
by  a. 

3.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign; 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  superscript  sign  u  (also  used  for  the 
vowel  a);  and 

(c)  when  the  following  modified  consonantal  forms  are  used: 

&  k  oti  n  nt  1  *  r 

gni  n  rib  t  oo  1 

4.  When  nn  appears  as  a  subscript  in  a  cluster,  it  is  transliterated  ta. 

5.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals , 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 


CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

□ 

underscore 

F6 
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Marathi 


Vowels  and  Diphthongs  (see  Note  1) 

Traditional  Style  New  Style  Romanization 


3t 

at 

a 

an 

arr 

a 

* 

fa 

i 

i 

aft 

T 

1 

at 

« 

U 

35 

at 

CN 

u 

at 

c 

r 

o 

r 

o 

1 

o 

** 

3T 

e 

at 

at 

e 

* 

£ 

ai 

art 

3ft 

o 

3TT 

art 

A 

o 

aft 

aft 

au 

Consonants  (see  Note  2) 

Gutturals  Palatals  Cerebrals  Dentals 


ka 

ca 

£ 

ta 

tt 

ta 

kha 

cha 

tha 

<=r 

tha 

*r 

ga 

at 

ja 

3 

da 

da 

gha 

5t 

jha 

dha 

dha 

5 

ha 

3t 

na 

na 

na 

Labials 


Semivowels  Sibilants 


Aspirate 


<T 

pa 

ya 

n 

sa 

pha 

T 

ra 

sha 

at 

ba 

at 

la 

tt 

sa 

bha 

la 

*t 

ma 

at 

va 
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Notes 


Anusvara  Visarga 

(see  Note  3) 


Avagraha 
(see  Note  4) 


□ 


□ 


h 


5 


) 

(apostrophe) 


1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign; 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  N  )  called  halanta  or 
virama. 

3.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
h  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

In  other  circumstances  it  is  transliterated  by  a  tilde  (  □  )  over  the  vowel. 

4.  When  doubled,  avagraha  is  transliterated  by  two  apostrophes  (  ”  ). 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

9 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

circumflex 

E3 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

5 

circle  below 

F4 
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Mongolian 

(Classical  Mongolian  in  Vertical  Script) 


Initial 

Medial 

Final 

Romanization 

i 

(see  Note  1) 

^  (see  Note  1) 

a 

A 

■*  (see  Note  1) 

^  (see  Note  1) 

e 

A 

A 

o 

i 

A 

G  (see  Note  1) 

(see  Note  1) 

o  or  u 

i 

a  ^  (see  Note  1) 

(see  Note  1) 

6  or  u 

•a 

-4  A 

n 

*> 

3 

ng 

•9 

i 

i 

q 

i 

g 

«9 

*9 

S> 

b 

« 9 

• 9 

2> 

P 

*> 

*> 

f 

* 

* 

*  -c 

s 

v« 

* 

s 

f  (see  Note  2) 

*  1  (see  Note  2) 

(see  Note  2) 

t  or  d 

4i 

4J 

44 

1 

* 

4* 

4a 

m 

M 

M 

c 

/I 

S 

j 

A 

■A 

O 

y 

(see  Note  2) 

**  (see  Note  2) 

^  (see  Note  2) 

k  or  g 

o 

k 

<r> 

r 

V 

4 

* 

1 

h 

Notes 

1.  Romanize  medial  and  final  vowels  according  to  the  rules  of  vocalic  harmony. 

2.  Romanize  the  dental  and  velar  stops  according  to  the  pronunciation  given  in  standard  dictionaries. 
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TRANSCRIPTION  OF  GALIG  LETTERS 


Vowels  and  Diphthongs 

3 

3 

h 

i 

4 

4 

A 


a 

a 

i 

TT 

1 

U 

u 

r 


Consonants 


Gutturals  Palatals 


*> 

r 

o 

1 

% 

I 

o 

A 

e 

4 

ai 

1 

o 

L 

au 

Cerebrals 

Dentals 

3 

ka 

a 

ca 

si 

ta 

• 

a 

ta 

3 

kha 

a 

cha 

J 

tha 

3 

tha 

3 

ga 

a 

ja 

3 

da 

• 

da 

gha 

3 

jha 

3 

dha 

• 

dha 

ha 

.3* 

ha 

na 

na 

Labials 

Semivowels 

Sibilants 

Aspirate 

3 

pa 

j 

ya 

3 

sa 

ha 

3 

pha 

j 

ra 

3 

sa 

3 

ba 

3 

la 

3 

sa 

£ 

bha 

a 

va 

3 

ma 

Anusvara 

Visarga 

O 

m 

• 

• 

o 

h 

• 

CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  modifiers  Name 

acute 

□  tilde 

□  macron 

□  dot  above 

□  dot  below 

□  circle  below 


USMARC  hexadecimal  code 

E2 

E4 

E5 

E7 

F2 

F4 
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Moplah 

Moplah 

(in  Arabic  Script) 


Letters  of  the  Alphabet 


Consonants 

Initial 

Medial 

Final 

Alone 

Romanization 

Gutturals 

<  3, 

dL 

k 

4^ 

4-^ 

4^- 

4^ 

kh 

3 

£ 

g 

4* 

4^ 

4^- 

4^ 

gh 

A 

e 

i 

n 

Palatals 

£ 

c 

0? 

0*- 

4?- 

4^- 

ch 

e- 

£ 

j 

0^ 

0^ 

4^ 

4^ 

jh 

S 

n 

Cerebrals 

x 

x 

t 

2>1 

.ox 

.ox 

£>:» 

th 

J 

ls- 

> 

J 

d 

*>> 

£>j_ 

2>j 

dh 

J 

4- 

O- 

o 

n 

Dentals 

J 

dx 

Cj 

t 

4^ 

4-*- 

4-*- 

4^ 

th 

X 

d 

21 

.OX 

.ox 

2)1 

dh 

u 

O 

n 

Labials 

mJ 

-A. 

v- 

C-J 

P 

4rf 

4^ 

44- 

.  4* 

ph 

J 

4- 

*-4- 

b 

44- 

44- 

44 

bh 

JO 

a. 

f- 

? 

m 
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Vowels  and  Diphthongs  (see  Note) 

Initial 


T  <  I' 


1 


r 


3 


l 


3 


I 


Romanization 

a 

a 

i 

r 

l 

u 

u 

r 

• 

e 

e 

ai 

o 

5 

au 


NOTE:  Only  the  vowel  forms  that  appear  at  the  beginning  of  syllables  are  listed.  The  forms  used 
for  vowels  following  a  consonant  exclude  I  ( alif ),  except  in  the  case  of  a  (  T  or  I'  ).  No  distinction 
between  the  two  is  made  in  romanization. 


Initial 

Medial 

Final 

Alone 

Romanization 

Semivowels 

-J 

*• 

s? 

s? 

y 

^r 

JT 

j. 

r 

J 

1 

J- 

J 

1 

3 

3- 

> 

J 

V 

V 

v 

V 

r 

u> 

V 

W 

u> 

V 

«*> 

V 

tt 

<_*?- 

1 

3 

3~ 

3 

1 

Silibants 

mJjJ 

* 

s 

-uL 

ur- 

s 

-*jU 

s 
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Moplah 


Aspirate 


Initial 

Medial 

Final 

Alone 

Romanization 

ZD 

■6 

•Q  <  <*> 

ZD 

h 

Arabic  Letters  Representing  Non-Malayalam  Consonants 


Initial 

Medial 

Final 

Alone 

Romanization 

J 

c*. 

thh 

Z* 

c- 

C 

h 

■jifc. 

& 

t 

kh 

s 

i. 

is. 

i 

z 

J 

> 

Jr 

J 

z 

s 

-*e»- 

V>=>. 

z 

L 

Jo. 

Jo 

t 

h 

Ji» 

-L. 

Jb> 

z 

X 

•SK, 

c- 

t 

4  (ayn) 

X 

JSC. 

& 

t 

gh 

•A 

A. 

ufi. 

f 

A 

si. 

o- 

q 

- 

. 

6 

h 

RULES  OF  APPLICATION 


Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 


1. 


fl  ,  □  ,  □  (jazm )  indicates  the  absence  of  any  vowel  following  the  letter  over  which  it  is  written.  It 
is  not  represented  in  romanization. 

prakrtam 


2. 


When  ( shaddah )  appears  over  any  consonant  or  digraph,  represent  it  in  romanization  by  doubling 
the  consonant  or  digraph. 

untakkappettat  xifkl'i.;  I 


3.  Romanize  v_5  (alif  maqsurah)  used  in  place  of  I'  to  represent  the  long  vowel  as  d. 

Musa 

‘Isa 
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4.  When  the  word  ending  in  ^  is  in  the  construct  state,  romanize  ^  as  t. 

Tarjamat  al-sirru  al-jalil  0  o 

Surat  al-Fatihah 

5.  When  medial  or  final,  romanize  *  ( hamzah )  as  ’ . 

Khulafa’ 

‘A’isah 
Za’ah 

6.  Romanize  the  Arabic  article  U'  (also  written  Jl )  as  al-. 

‘alaihi  al-salam 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

alif 

AE 

< 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

□ 

underscore 

F6 

•A.  llrfe. 


\ 
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Non-Slavic 


Non-Slavic  Languages 

(in  Cyrillic  Script) 


Non-Slavic  languages  in  the  Cyrillic  script  make  use  of  many  letters  from  the  basic  Cyrillic  alphabet 
presented  below.  For  those  letters,  use  the  romanization  shown  in  the  table.  For  other  letters,  follow 
the  romanization  given  in  the  applicable  language  section  on  the  pages  that  follow.  Languages  are  listed 
in  alphabetical  order  by  name.  Dates  after  the  name  of  a  language  refer  to  the  year  the  Cyrillic  alphabet 
was  introduced  or  the  year  an  alphabet  was  modified  to  accommodate  spelling  reform. 

Upper  case  letters  Lower  case  letters 

Vernacular  Romanization  Vernacular  Romanization 


A 

A 

a 

a 

B 

B 

6 

b 

B 

V 

B 

V 

r 

G 

r 

g 

n 

D 

a 

d 

E 

E 

e 

e 

E 

•• 

E 

e 

e 

Zh 

X 

zh 

3 

Z 

3 

z 

M 

I 

n 

i 

Vi 

I 

n 

V 

i 

K 

K 

K 

k 

n 

L 

n 

1 

M 

M 

M 

m 

H 

N 

H 

n 

o 

O 

O 

o 

n 

P 

n 

P 

p 

R 

P 

r 

c 

S 

c 

s 

T 

T 

T 

t 

y 

U 

y 

u 

<i> 

F 

$ 

f 

X 

Kh 

X 

kh 

n 

f$> 

U 

t? 

4 

Ch 

H 

ch 

HI 

Sh 

in 

sh 

m 

.Shch 

m 

shch 

Td 

11  (hard  sign) 

T> 

"  (hard  sign) 

BI 

Y 

bl 

y 

B 

'  (soft  sign) 

b 

'  (soft  sign) 

3 

E 

3 

e 

K> 

ItJ 

K3 

1U 

if 

Tk 

a 

ia 
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Upper  case  letters  Lower  case  letters 

Vernacular  Romanization  Vernacular  Romanization 


Abaza  (1938) 


I  R 

' b  11  (hard  sign) 


Abkhaz  (1954) 


J? 

Gh 

y 

J 

0 

Ch 

$ 

CK 

3 

Z 

3 

152: 

S 

Kh 

K 

Q 

O 

W 

Ih 

Ph 

T) 

Th 

* 

H 

y 

h 

Ch 

0 

W 

5 

gh 

U 

j 

te 

ch 

cK 

3 

z 

3 

3z 

•5 

(O 

kh 

k 

q 

9 

w 

n? 

ph 

h 

(  Ti  ) 

th 

5 

U) 

R 

• 

h 

ts 

q 

(4) 

ch 

9 

w 

Adygei  (1938) 

I  H 


I  R 

•b  11  (hard  sign) 


Aisor 

d 

0 

J 

L 

Q 

S 

T 

h 


D 

A 

c/ 

I 

L 

Q 

S 

T 

H 


d 

9 

j 

1 

q 

s 

t 

h 


d 

a 

1 

1 

q 

s 

t 

R 


Altai  (1845  &  1922) 
J 

IT  OHO 

6 

y 


D 

6 

u 


j 

H  (fi) 


6 

y 


a 

ng 

6 

u 
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Non-Slavic 


Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Altai  (1938)  No  additional  characters. 

Altai  (1944) 

J 

D 

j 

a 

IT  (H’) 

K  (h’) 

fig 

6 

6 

•• 

o 

6 

•• 

y 

• 

U 

•• 

y 

• 

u 

Avar  (1938) 

- 

- 

i 

H 

- 

- 

T> 

u  (hard  sign) 

Azerbaijani  (1940) 

f 

Gh 

■F 

gh 

© 

A 

9 

a 

w 

I 

ft 

V 

1 

j 

I 

j 

« 

1 

K 

6 

K  (HC) 

• 

g 

0 

6 

e 

6 

Y 

u 

Y 

• 

u 

h 

H 

h 

R 

4 

j 

M 

j 

9 

’  (apostrophe) 

9 

’  (apostrophe) 

The  letters  J  ,  j  replaced  ft  in  1959. 

Balkar  (1936) 

9 

’  (apostrophe) 

9 

’  (apostrophe) 

“ 

• 

T> 

11  (hard  sign) 

Balkar  (1939)  No  additional  characters. 

Bashkir  (1939) 

F 

Gh 

F 

gh 

3 

fPl 

3 

tK 

K 

Q 

K  (k) 

q 

I* 

N(j 

K 

fig 

© 

6 

e 

6 

Q 

Th 

S 

th 

Y 

U 

Y 

• 

u 

h 

H 

h 

h 

0 

A 

3 

a 
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Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Buryat  (1939) 

e 

O 

e 

6 

y 

• 

u 

Y 

• 

u 

h 

H 

h 

R 

Chechen  (1862) 

3 

Gh 

£ 

gh 

IJ 

J 

0 

j 

h 

H 

h 

R 

J 

w 

I 

j 

V' 

i 

k 

Kh 

kh 

k 

• 

Q 

k 

• 

q 

n 

N 

4 

n 

Ph 

Q 

ph 

Q 

Q 

q 

q 

5 

Gh 

gh 

T 

Th 

i 

th 

± 

m 

& 

ft  (or  (?  ) 

Ts 

U  (or  (?) 

ts 

k  (or  f?  ) 

Ch 

m  (or  tT) 

ch 

Chechen  (1908) 

n 

Gh 

n 

gh 

h 

H 

h 

h 

J 

w 

I 

j 

V' 

i 

kh 

kh 

k 

Q 

k 

q 

H 

N 

n 

il 

Ph 

q 

Ph 

Q 

Q 

q 

q 

b 

Gh 

V 

gh 

T 

Th 

T 

th 

X 

ffi 

X 

u 

Ts 

w 

U 

ts 

4 

Ch 

M 

ch 

Chechen  (1938) 

I 

H 

I 

R 

- 

- 

T> 

11  (hard  si 
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Non-Slavic 


Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Chukchi  (1958) 

S 

Q 

S 

q 

d 

ng 

- 

- 

» 

’  (apostrophe) 

Chuvash  (Missionary) 

No  additional  characters. 

Chuvash  (1872,  modified  1923,  1926,  1933) 

A 

A 

a 

a 

E 

E 

e 

e 

s 

L 

n?  (n>) 

a 

1 

lb 

N 

i+  (h>  h,  ) 

h 

Q 

S 

S 

* 

s 

T 

T 

v 

T 

* 

t 

Ch 

ch 

f 

U 

y  ( y ) 

• 

u 

Chuvash  (1938) 

A 

A 

a 

a 

E 

E 

e 

e 

Q 

s 

S 

§ 

* 

s 

y 

u 

y 

u 

Dargwa  (Uslar) 

JE 

ae 

ae 

W 

W 

w 

w 

S 

Gh 

£ 

gh 

f 

Gh 

w 

r 

gh 

T) 

J 

* 

j 

3 

3 

dz 

h 

ft 

h 

K 

o 

Kh 

f, 

kh 

m 

h 

s 

H 

m 

h 

K 

J 

I 

j 

V 

i 

K 

Kh 

kh 

k 

Q 

k 

q 

K 

s 

K 

W 

K 

k 

n 

Ph 

a 

ph 

Q 

Q 

q 

q 

Gh 

gh 

T 

Th 

i 

th 

fSh 

ff 

tsh 

* 

Ch 

* 

ch 

n  (hard  sign) 
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Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular  Romanization 

Dargwa  (1938) 

- 

i  -•  ■ 

I 

R 

- 

T> 

11  (hard  sij 

Dungan  (1952) 

0 

A 

9 

a 

J 

* 

j 

9 

H  (^) 

ng 

y 

U 

y 

u 

Y 

U 

y 

u 

Eskimo-Yuit  Dialect 

r 

Gh 

r’ 

gh 

K’ 

Q 

K’ 

q 

/I’ 

L 

n’ 

\ 

H’ 

I ^ 

H’ 

fig 

X’ 

H 

x’ 

R 

y 

W 

y 

w 

Even  (1937)  No  additional  characters. 

Even  (1959) 

9 

y 

fig 

© 

6 

e 

6 

0 

6 

© 

5 

Evenki  (1937)  No  additional  characters. 

Evenki  (1958) 

9 

y 

fig 

Gagauz--Bulgaria 

This  language 

is  romanized  according  to  the  Bulgarian  table. 

Gagauz--U.S.S.R.  (1957) 

A 

A 

a 

a 

6 

6 

0 

• 

0 

y 

u 

y 

• 

u 

Ingush  (1938) 

I 

H 

1 

R 

- 

- 

T> 

11  (hard  si: 
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Non-Slavic 


Upper  case  letters  Lower  case  letters 


Vernacular 

Romanization 

Vernacular 

Romanization 

Kabardian  (1936) 

. 

- 

9 

h 

- 

- 

T> 

"  (hard  sign) 

Kabardian  (1938) 

I 

H 

I 

K 

- 

- 

T> 

n  (hard  sign) 

Kalmyk  (1927)  No  additional  characters. 

Kalmyk  (1938) 

A 

A 

a 

a 

6 

Mf 

o 

•• 

o 

v!/ 

o 

y 

u 

y 

u 

Kalmyk  (1957) 

0 

A 

3 

a 

h 

• 

H 

h 

h 

J 

j 

- 

- 

H, 

ng 

e 

6 

e 

6 

Y 

0 

y 

u 

Karachay  (1938) 

w 

9 

y 

w 

- 

- 

T> 

"  (hard  sign) 

Karachay-Balkar 

* 

w 

9 

y 

w 

- 

- 

T> 

"  (hard  sign) 

Karakalpak  (1940) 

No  additional  characters. 

- 

t 

11  (hard  sign) 

Karakalpak  (revised 

ca.  1947) 

F 

Gh 

■F 

gh 

K 

Q 

q 

X, 

H 

x. 

R 
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Upper  case  letters  Lower  case  letters 

Vernacular  Romanization  Vernacular  Romanization 


Karakalpak  (1957) 

¥  Gh 

K,  Q 


© 

y 

y 

* 

0 


6 

Xj 

w 

H 

A 


-F 

k 

e 


gh 

q 

ng 

6 


Y 

y 

k 

9 


U 

W 

K 

a 


Karelian 

A 

6 

y 


A 

6 

u 


a 

6 


a 

6 


u 


Kazakh  (1940) 


0 

A 

¥ 

Gh 

Q 

I ^ 

0 

6 

Y 

U 

Y 

U 

h 

ft 

I 

T 

9 

a 

-F 

(  f) 

gh 

k 

(IS) 

q 

H, 

ng 

0 

6 

Y 

(y) 

u 

Y 

u 

h 

K 

i  7 


Khakass  (1893  missionary) 


••  • 

O  O 

••  • 

y  u 

j  5 


IT 


y 

j 


ng 

6 

u 

T- 

J 


Khakass  (1924-1927) 
J 

y 

6 


j 

u 

6 


j 

y 

o 

IT 

h 


1- 

J 

U 

6 

hg 

gh 
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Upper  case  letters 

Lower  case 

letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Khakass  (1939) 

- 

- 

-F 

gh 

i 

i 

i 

TT 

1 

- 

- 

H, 

ng 

6 

6 

6 

6 

y 

u 

y 

u 

- 

- 

9(H) 

j 

Khanty  (1937) 

n’  (jv) 

L’  (apostrophe) 

n’  (n‘) 

F  (apostrophe) 

IV b  (TVb) 

Lv  (apostrophe; 

n’b  (n‘b) 

V'  (apostrophe; 

soft  sign) 

soft  sign) 

Note:  In  1952  the  generic  Cyrillic-based  alphabet  for  Khanty  was  replaced  by 
separate  alphabets  for  each  of  the  four  individual  dialects. 


Khanty-Shuryshkary  Dialect  (1952) 


Khanty-Kazym  Dialect  (1952) 


A 


0 

0 


A 

A 

vi/ 

A 


/T  V 


e 

e 


6 

V!/ 

o 


Khanty-Surgut  Dialect  (1952) 
A  A 

S  Q 


e  o 


y 

0 


u 

A 


y 


ng 


a 

o 

•• 

3 

n’ 

y 

e 

e 


a 

a 

a 

r 

ng 

6 

C/ 

O 


a 

IS 

y 

0 

0 

y 

3 


a 

q 

ng 

5 

• 

o 

u 

a 

’  (apostrophe) 
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Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Roman 

Khanty- -Vakil  Dialect  (1952) 

A 

A 

a 

a 

Q 

*S 

q 

- 

- 

U 

ng 

6 

6 

6 

6 

© 

6 

e 

o 

© 

o 

© 

\£* 

O 

y 

u 

y 

u 

0 

A 

9 

a 

0 

\Z> 

A 

§ 

a 

-  • 

- 

y 

’  (a 

Kirghiz  (1940) 

n 

H 

ng 

© 

6 

e 

6 

Y 

u 

Y 

u 

Komi  (1938) 

- 

- 

i 

i 

6 

6 

6 

6 

Komi  (Molodtsov)  (1919) 

d 

D 

d 

d 

dj 

D 

& 

a 

Dzh 

dzh 

Z 

% 

s 

z 

% 

52 

3z 

J 

I 

j  ‘ 

1 

lb  (Jh) 

L 

a,  (n>) 

\ 

Hi  (Hi) 

N 

Kb  (h>) 

n 

6 

6 

o 

6 

c 

s 

c 

✓ 

s 

f 

T> 

s 

t 

Komi-Permyak  (Missionary) 

6 

o 

o 

6 

Komi-Permyak  (1938) 

- 

- 

i 

T 

O 

6 

o 

6 

(apostrophe) 
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Upper  case  letters 
Vernacular  Romanization 


Lower  case  letters 
Vernacular  Romanization 


Koryak 

k 

Q 

IS 

q 

m 

y 

ng 

- 

- 

9 

’  (apostrophe) 

Kumyk  (1938)  No  additional  characters. 

- 

- 

T> 

11  (hard  sign) 

Kurdish  (1946) 

t 

G’ 

r  (r’) 

g’ 

0 

A 

9 

a 

£ 

A’ 

3  (3’) 

a’ 

It 

K’ 

K  (KO 

k’ 

6 

6 

•• 

O 

6 

A 

P’ 

n  (n’) 

P’ 

R’ 

p  (pO 

T  (T*) 

r’ 

f 

T* 

t’ 

h 

H 

h 

ti 

H’ 

h  (h’) 

li’ 

k 

Ch’ 

M  (m») 

ch’ 

q  (Q) 

Q 

q 

q 

w 

W 

w 

w 

- 

- 

> 

’  (apostrophe) 

Lak  (1864) 

3 

Gh 

£ 

gh 

h 

H 

h 

K 

It 

o 

Kh 

f, 

kh 

fi 

Hw 

fi 

Hw 

h 

H 

m 

h 

K 

J 

w 

I 

j 

V 

1 

it 

S 

K 

w 

K 

k. 

Kh 

kh 

k 

Q 

k 

• 

q 

Q 

Q 

q 

q 

<? 

S 

s 

s 

T 

T 

X 

t 

T 

Th 

I 

th 

* 

m 

k 

IcR 

ft 

Ts 

9 

U 

ts 

IT 

Ts 

V 

ts 

<{ 

Ch 

ch 

Ch 

> 

M 

ch 

ill 

Sh 

LU 

sh 
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Upper  case  letters  Lower  case  letters 

Vernacular  Romanization  Vernacular  Romanization 


Lak  (1938) 


Lapp  (Missionary) 


Lapp  (1937) 


Lezghian  (1938) 


Lezghian  (Uslar) 
/E 
W 

3 

f 

3 

h 

h 

K 

j 

k 

K 

k 

R 

ti 

n 

oi 

T 

T 

y 

<{ 

* 


W 

Gh 

6h 

J 

152: 

H 

o 

Kh 

✓ 

H 

I 

K 

Kh 

• 

Q 

s 

Q 

P 

Ph 

Q 

Gh 

T 

Th 

o 

f§h 

Ts 

Ch 

Ch 


I  H 

t>  u  (hard  sign) 


H’ 

I 

"b 

ae 

w 

S. 

w 

r 

b 

3 

h 

?, 

m 

h 

j 

✓ 

K 

k 

w 

K 

n 

fl 

q 

e 

x 

T 

y 

ff 

V 

M 

* 


ng 


ng 

H 

"  (hard  sign) 

ae 

w 

gh 

gh 

j 

3z 

H 

kh 

K 

i 

k 

kh 

• 

q 

✓ 

q 

p 

ph 

q 

gh 

t 

th 

u 

tsh 

ts 

ch 

ch 

11  (hard  sign) 
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Non-Slavic 


Upper  case  letters  Lower  case  letters 


Vernacular 

Romanization 

Vernacular 

Romanization 

Lithuanian 

>K 

Z 

5K 

z 

J 

Pi 

j 

U 

c 

U 

c 

c 

M 

V 

c 

HI 

s 

LU 

V 

s 

• 

E 

t 

e 

Mansi  (1937) 

H> 

• 

ng 

Mansi  (1958) 

- 

y 

ng 

Mari--Meadow  Dialect  (Missionary) 

I5  E 

A 

IO 

•• 

e 

Mari-Meadow  Dialect  (1870’s  and  early  Soviet) 

A  A 

•• 

a 

a 

IT 

H1 

ng 

••  • 

O  O 

6 

6 

••  • 

y  u 

y 

u 

Mari--Meadow  Dialect  (1938) 

- 

:  - 

if 

ng 

6 

6 

6 

6 

y 

• 

u 

y 

u 

Mari-Mountain 

Dialect  (missionary) 

15 

E 

IO 

•• 

e 

Mari— Mountain 

Dialect  (1870’s  and  early  Soviet) 

A 

A 

•• 

a 

a 

IT 

H- 

ng 

6 

6 

o 

6 

y 

U 

•• 

y 

u 

bi 

Y 

•• 

bi 

y 
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Upper  case  letters 


Lower  case  letters 


Vernacular 

Romanization 

Vernacular 

Romanization 

Mari-Mountain  Dialect  (1938) 

A 

A 

•• 

a 

a 

•• 

O 

• 

O 

6 

6 

y 

• 

u 

y 

u 

bi 

Y 

bi 

y 

Moldavian  (Early  and  1924) 

- 

- 

» 

’  (apostrophe) 

Moldavian  (1937) 

>k 

J 

X 

j 

This  letter  was  added  to  the  Moldavian  alphabet  in  1958. 

Mongolian  (1941-1945) 

0 

6 

© 

6 

Y 

• 

u 

y 

u 

Mordvin-Erzya  Dialect 

(Early  missionary) 

l5 

E 

IO 

•• 

e 

Mordvin-Erzya  Dialect 

(Later  missionary) 

IT 

H- 

ng 

Mordvin-Erzya  Dialect 

(Early  Soviet  &  1938) 

No  additional  characters. 

Mordvin-Moksha  Dialect  (Missionary) 

SB 

A 

a 

Mordvin-Moksha  Dialect  (1923) 

E 

E 

e 

e 

IK 

LkK 

nc 

lkh 

IT 

H 

fig 

6 

6 

o 

5 

K 

Rkh 

V 

rldi 

bi 

Y 

bi 

y 

3 

A 

•• 

3 

a 

Mordvin-Moksha  Dialect  (1938)  No  additional  characters. 
Nanai  No  additional  characters. 


ALA-LC  Romanization  Tables 


page  151 


Non-Slavic 


Upper  case  letters  Lower  case  letters 

Vernacular  Romanization  Vernacular  Romanization 


Nenets  (1937) 


’  (apostrophe) 


Netets  (1958) 

Nivkh 

Nogai  (1938)  No  additional  characters. 

Ossetic-Digor  Dialect  (1938) 

A  A 

Ossetic--Iron  Dialect  (1938) 

A  A 

Note:  When  used  alone,  the  language  name 

Romany-Bulgaria 

p  n 

Romany-U.S.S.R. 

r ( f  )  Gh 

Selkup  (1936) 

Shor  (Missionary) 


Shor  (1927) 


Shor  (1938) 


y 

ng 

’  (apostrophe) 

ae 

» 

ae 

’  (apostrophe) 

< 

‘  (ayn) 

11  (hard  sign) 

ae 

ae 

"b 

n  (hard  sign) 

Ossetic"  usually  refers  to  this  dialect. 

P 

l 

r  (-f) 

gh 

> 

’  (apostrophe) 

y 

*  . 
u 

6 

6 

H" 

ng 

j 

ch 

IT 

ng 

6 

6 

y 

u 

y 

u 

6 

•o 
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Upper  case  letters  Lower  case  letters 


Vernacular 

Romanization 

Vernacular 

Romanization 

Tabasaran  (1938) 

-■ 

- 

I 

h 

- 

- 

T> 

"  (hard  sign) 

Tajik  (1940) 

¥ 

Gh 

gh 

- 

- 

n 

t 

l 

K, 

Q 

k 

q 

y 

U 

y 

u 

x, 

ft 

k 

H 

H 

J 

H 

j 

- 

- 

•b 

11  (hard  sign) 

Tat 

- 

- 

I 

K 

- 

- 

T> 

"  (hard  sign) 

Tatar  (1938) 

0 

A 

9 

a 

e 

6 

0 

6 

Y 

u 

Y 

u 

j 

>k  (^) 

j 

ft 

H  (d) 

ng 

h 

H 

h 

H 

Tatar--Crimea  No  additional  characters. 

- 

- 

T> 

11  (hard  sign) 

Tatar- Kryashen 

A 

A 

a 

a 

- 

- 

H" 

ng 

6 

6 

O 

6 

y 

u 

y 

u 

Turkmen  (1940) 

* 

j 

j 

ft 

H 

ng 

e 

6 

e 

6 

Y 

u 

Y 

u 

0 

A 

9 

a 
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Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Tuva 

i 

- 

K 

ng 

e 

6 

e 

6 

Y 

u 

Y 

u 

Udekhe 

- 

- 

9 

’  (apostrophe) 

Udmurt  (missionary) 

6 

6 

6 

6 

Udmurt-Early  Soviet  and  1937  alphabets) 

at 

j 

•• 

5K 

j 

3 

152: 

3 

dz 

- 

- 

•• 

U 

V 

l 

6 

6 

O 

6 

M 

Ch 

M 

ch 

Uighur  (1947) 

F 

Gh 

■F 

gh 

0 

A 

9 

a 

J 

>k 

j 

K, 

Q 

k  (O 

q 

H 

NCj 

H 

ng 

e 

6 

e 

6 

Y 

• 

u 

Y 

ix 

h 

H 

h 

R 

Uzbek  (1940) 

-F 

Gh 

■F 

gh 

K 

Q 

k  (k) 

q 

y 

U 

y  (y) 

u 

* 

H 

k  (k) 

R 

Yakut  (1819)  No  additional  characters. 
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Upper  case  letters 

Lower  case  letters 

Vernacular 

Romanization 

Vernacular 

Romanization 

Yakut  (1851  Bohtlingk) 

A 

E 

•• 

a 

• 

e 

6 

• 

O 

•• 

o 

• 

o 

y 

• 

u 

•• 

y 

• 

u 

Gh 

gh 

n 

H  (w) 

ng 

V 

t> 

U 

a 

6 

H  (H*) 

n 

j 

\2/ 

I 

j 

1 

f 

ffr 

t 

O 

in 

L 

L 

1 

i 

- 

- 

h 

K 

A,  etc. 

AA,  etc. 

a,  etc. 

aa,  etc. 

Yakut  (1939) 

F 

Gh 

5 

gh 

H 

w 

ng 

© 

6 

e 

• 

o 

h 

H 

h 

K 

Y 

U 

Y 

• 

u 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

apostrophe 

27 

JE 

digraph  AE  (upper  case) 

A5 

/ 

soft  sign 

A7 

c 

ayn 

BO 

ae 

digraph  ae  (lower  case) 

B5 

// 

hard  sign 

B7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

□ 

umlaut  (dieresis) 

E8 

e 

hachek 

E9 

a 

circle  above  (angstrom) 

EA 

G 

ligature,  1st  half 

EB 

0 

ligature,  2nd  half 

EC 

a 

candrabindu 

EF 

g 

cedilla 

FO 

□ 

dot  below 

F2 
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Vowels  and  Diphthongs  (see  Note  1) 


a 

a 

Q 

€) 

9 

Q 

9 


a 

a 

i 

*T* 

1 

U 

u 

r 


9 

•c 

<§ 

51 


r 

1 

o 

e 

ai 

o 

au 


Consonants  (see  Note  2) 

Gutturals 

Palatals 

Cerebrals 


Dentals 


Q 

ka 

Q 

ca 

a 

kha 

e 

cha 

9 

ga 

© 

ja 

Q 

gha 

$ 

jha 

& 

ha 

s 

ha 

$ 

ta 

© 

ta 

O 

tha 

• 

a 

tha 

@ 

da 

9 

da 

9 

dha 

• 

u 

dha 

€1 

na 

• 

Q 

na 

Labials 


Semivowels 


Sibilants 


Aspirate 


0 

pa 

0 

ya 

€1 

sa 

Q 

pha 

q 

ya 

9 

sha 

9 

ba 

Q 

ra 

9 

sa 

9 

bha 

fi) 

la 

q 

ma 

$ 

la 

9 

ba 

(see  Note  3) 

Anusvara 

Bisarga 

Candrabindu 

Abagraha 

(see  Note  4) 

(anunasika) 

(see  Note 

(see  Note  5) 

0  m 

0 

8  h 

• 

□  n,m 

*  , 

(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  v  )  called  hasanta. 

3.  q  is  used  both  as  a  labial  and  as  a  semivowel.  When  it  occurs  as  the  second  consonant  of 
a  consonant  cluster,  it  is  transliterated  va.  When  q  is  doubled,  it  is  transliterated  bba. 

4.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

5.  Candrabindu  before  gutteral,  palatal,  cerebral,  and  dental  occlusives  is  transliterated  n. 
Before  labials,  sibilants,  semivowels,  the  aspirate,  vowels,  and  in  final  position  it  is 
transliterated  in. 

6.  When  doubled,  abagraha  is  transliterated  by  two  apostrophes  (  ”  ). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

9 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

\!/ 

□ 

candrabindu 

EF 

□ 

dot  below 

F2 

□ 

circle  below 

F4 
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Ottoman  Turkish 


RULES  OF  APPLICATION 

1.  Ottoman  Turkish  is  meant  here  to  mean  the  Turkish  language  as  found  in  written  records  of  the 
area  controlled  by  the  Ottoman  Empire  (ca.  1300  to  1919)  and  in  use  in  Turkey  from  1919  until 
the  adoption  of  the  Roman  alphabet  (officially  introduced  in  1928).  Ottoman  Turkish  covers 
Turkish  written  in  the  Arabic  alphabet  and  in  other  non-Roman  alphabets  as  well  (e.g., 
Armenian,  Cyrillic,  Greek,  Hebrew).  It  does  not  include  the  other  Turkic  languages  of  Eastern 
Europe,  the  Near  East  and  Central  Asia. 

2.  In  romanizing  Ottoman  Turkish  the  principle  of  conversion  is  applied  as  far  as  possible,  i.e.,  the 
word,  phrase,  name,  or  title  being  romanized  is  represented,  if  possible,  by  the  form  it  has  in 
modern  Turkish  orthography,  even  if  that  means  converting  some  letters  to  their  modern 
equivalents.  Foreign  words,  or  words  of  non-Turkish  origin  that  have  become  loan-words  in 
Turkish,  are  converted  like  Ottoman  Turkish.  Modern  Turkish  usage  in  capitalization  is 
followed. 

3.  Some  limitations  to  the  applicability  of  conversion  are  recognized.  When  the  orthography  of  the 
Ottoman  Turkish  original  reveals  a  conflict  with  modern  Turkish  usage,  whether  in 
pronunciation,  in  syntax,  or  in  vocabulary,  the  letter  (letters),  word,  or  expression  is  romanized 
according  to  the  table.  Conversion  as  a  principle  is  not  applied  to  word  order,  which  is  not 
changed  to  conform  to  modern  syntax.  Obsolete  terms  are  not  replaced  by  their  modern 
equivalents  to  reflect  current  terminology.  Paragraphs  5-8  below  provide  more  detailed  guidance. 

4.  Some  variations  in  modern  Turkish  orthography  are  found  among  standard  publications.  The 
reference  works  listed  below  are  recommended  for  guidance  in  romanization. 

Dictionaries  and  Spelling  Guides 

Buyiik  lugat  ve  ansiklopedi  ("Meydan  Larousse").  1969-1973. 

Hony,  H.C.  A  Turkish-English  dictionary.  2nd  ed.  1957. 

Redhouse  yeni  Turkqe-Ingilizce  sozliik.  1968. 

Turk  Dil  Kurumu.  Yeni  imla  kdavuzu.  3rd  ed.  1967  (And  other  editions) 

Turkey.  Mill!  Egitim  Bakanligi.  Turk  ilmi  transkripsyon  kdavuzu.  1946. 

Specialized  Works 

Islam  ansiklopedisi.  1940- 

Istanbul.  Universite.  Kutiiphane.  Istanbul  Universitesi  Kutuphanesi  Tiirkge  basmalar  alfabe 
katalogu  (1729-1928).  1956. 

Koray,  Enver.  Tiirkiye  tarih  yaymlan  bibliyografyasi,  172911955- 
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Turkiye  bibliyografyasi.  1934- 

Chapman,  Harry  W.  1939-1948  Turkiye  bibliyografyasi  dzad  indeski  =  A  First-name  index  to 
the  Turkiye  bibliyografyasi,  1939-1948.  1968. 


When  the  Turkish  sources  do  not  agree,  the  usage  which  seems  predominant  is  followed.  For 
certain  kinds  of  variation,  a  uniform  treatment  is  adopted.  (See  paragraphs  5-8  below.) 

A§ik  Pa§a  zade  or  A§ik  Pa§azade  or  A§ikpa§azade 

Heading:  A§ikpa§azade 

When  the  word  or  expression  being  looked  for  is  not  found  in  Turkish  sources,  it  is  romanized 
by  analogy  with  similar  words  or  expressions.  References  are  freely  made  when  they  will  help 
to  guide  a  user  of  the  catalog  from  forms  found  in  published  sources  to  the  form  that  has  been 
adopted  for  the  catalog.  Variants  from  which  references  are  made  may  be  either  names  or  the 
titles  of  works.  References  are  especially  useful  when  the  title  may  be  interpreted  as  being  in 
any  of  several  languages.  Added  entries  may  be  provided  instead  of  references  when  appropriate. 

5.  Phonetic  considerations  contribute  to  variability  and  inconsistency  in  the  spelling  of  certain  words 
and  classes  of  words.  For  cataloging  purposes,  the  orthography  specified  below  is  adopted  in  the 
conversion  of  both  words  and  names. 

(a)  Final  -p  versus  final  -b:  the  spelling  p  is  adopted  for  all  cases  in  which  p/b  variation  occurs. 

tip  not  tib 
harp  not  harb 
Ratip  not  Ratib 

but  Bab-i  Saadet  [no  variation] 

(b)  Final  -t  versus  final  -d:  the  spelling  t  is  adopted  for  all  cases  in  which  tld  variation  occurs. 

mevcut  not  mevcud 
Ahmet  not  Ahmed 
but  Bel  grad  [no  variation] 

(c)  Final  -k  versus  final  -g:  The  spelling  k  is  adopted  for  all  cases  in  which  k/g  variation  occurs. 

renk  not  reng 

(d)  Final  -g  versus  final  -c:  the  spelling  g  is  adopted  for  all  cases  in  which  g/c  variation  occurs. 

burg  not  burc 

(e)  Medial  -tt-  versus  medial  -dd-:  the  spelling  dd  is  adopted  for  all  cases  in  which  variation 
occurs. 

Bedreddin  not  Bedrettin 

(f)  Medial  -iy-  versus  -iyy-:  the  spelling  -iy-  is  adopted  for  all  cases,  whether  the  particular  word 
concerned  is  variously  spelled  or  not. 

harbiye  not  harbiyye 
cumhuriyet  not  cumhuriyyet 
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6. 


Other  variations  require  a  standardized  orthography  for  cataloging  purposes. 

(a)  Izafet.  Final  i  or  1  preceded  by  a  hyphen,  the  combination  functioning  as  the  sign  of  izafet, 
may  otherwise  be  found  in  the  form  i  or  1  (sometimes  yij  yi)  added  directly  to  the  word  being 
modified.  In  converting  from  Ottoman  Turkish,  -i  is  added  to  the  word  being  modified  when 
it  ends  in  a  consonant,  -yi  when  it  ends  in  a  vowel.  Vowel  harmony  is  disregarded. 

Devlet-i  Aliye-yi  Osmaniye 

Izafet  is  not  indicated  between  the  elements  of  a  personal  name  except  when  it  is  expressly 
indicated  in  the  Ottoman  Turkish  orthography. 


(b)  The  Arabic  article  al,  when  romanized  in  modern  Turkish,  may  take  various  forms,  among 
them:  al,  el,  ur,  ill,  %  iis,  ed,  en,  and  et.  When  the  article  functions  as  the  middle  element  of 
a  phrase  or  compound  word  or  name,  within  a  Turkish  context,  it  is  converted  in  accordance 
with  the  predominant  usage  in  modern  Turkish  orthography  for  the  particular  word  or  name. 
It  will  usually  happen  that  the  component  elements  are  combined  as  a  single  word, 
alelumum  Abdiissettar 

bilhassa  Abdurrahman 

maalesef  Ebulgazi 

dariilfunun  x-reffrom  Ebiil  Gazi 


When  the  Arabic  article  occurs  in  an  Arabic  title,  it  is  romanized  according  to  the  rules  for 
Arabic.  (See  (9)  below.) 


(c)  Diacritics.  A  dieresis  (LJ)is  retained  if  found  to  be  commonly  used  in  Turkish  in  the 
particular  word  concerned,  even  if  it  is  not  used  consistently.  A  circumflex  (□)  is  retained 
where  found  in  modern  Turkish  usage.  It  is  always  placed  over  the  final  long  i  of  the  relative 
adjective  (nisbe)  and  names  derived  from  it. 


(d)  An  apostrophe  (’)  is  inserted  between  a  proper  noun,  or  a  number,  and  a  suffixed  inflection. 
Mustafa’nin  vakfi 
1975’te 
1976’de 


7.  Non-Turkish  titles.  Rarely,  an  Ottoman  Turkish  work  will  be  found  to  bear  a  title  that  is 
entirely  non-Turkish.  "Non-Turkish"  should  not  be  taken  to  refer  to  any  word  or  phrase  that 
is  only  foreign  in  origin  but  has  been  accepted  as  belonging  to  Turkish.  (This  caveat  is 
particularly  necessary  for  words  and  expressions  that  derive  from  Arabic,  the  majority  of 
which  have  become  Turkish.)  As  already  indicated  in  the  present  scheme,  Turkish 
lexicographic  sources  will  settle  most  such  questions.  In  the  very  rare  case  when  a  title  is 
non-Turkish  (i.e.,  the  words  and  phrases  used  have  not  been  accepted  into  the  Turkish 
language),  then  the  title  may  be  romanized  according  to  the  rules  for  the  language  involved. 

8.  General  notes. 


(a)  Word  divisions  follow  predominant  usage  in  modern  Turkish.  Where  word  division  in  the 
original  varies  decidedly  from  modern  style  and  it  is  desirable  to  reflect  the  presentation  in 
the  publication  being  cataloged,  the  Ottoman  Turkish  word  division  may  be  retained. 
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(b)  The  Arabic  word  ibn ,  "son  of',  occurring  in  names  of  the  Islamic  type,  usually  appears  in 
modern  Turkish  orthography  as  Ibn  at  the  beginning  of  a  name  and  as  bin  (often  abbreviated 
b.)  in  the  middle  of  a  name.  The  Ottoman  Turkish  form  is  romanized  so  as  to  maintain  this 
distinction  uniformly,  but  the  abbreviation  b.  is  not  used. 

9.  When  conversion  is  impractical  because  the  word  or  words  to  be  romanized  are  not  part  of  the 
lexicon  of  modern  Turkish  and  cannot  be  documented  in  Turkish  sources  in  roman  orthography, 
and  analogous  forms  are  not  available  for  guidance,  the  orginal  is  romanized  letter  by  letter,  as 
prescribed  by  the  rules  for  romanization  of  the  alphabet  concerned.  When  a  title  is  being 
romanized  and  the  language  of  the  title  is  not  identifiable  with  certainty,  the  title  is  romanized 
as  from  Ottoman  Turkish  according  to  the  table.  It  should  be  noted  that  letter-by-letter 
romanization  is  applied  only  to  the  word  or  expression  which  cannot  be  converted  to  modern 
Turkish  orthography.  It  may  only  be  necessary  to  transliterate  a  single  letter  or  group  of  letters 
within  a  word  which  is  otherwise  convertible  to  modern  Turkish.  The  associated  words  in  a 
context  are,  as  far  as  possible,  romanized  according  to  the  principles  of  conversion. 

If  the  Ottoman  Turkish  original  is  written  in  the  Arabic  alphabet,  the  consonants  are  romanized 
as  in  the  Ottoman  Turkish  table  presented  in  this  section. 

The  vowels  and  orthographic  symbols  other  than  vowel  signs  ( hemze ,  \edde,  medde,  tenvin )  are 
romanized  according  to  the  rules  for  Arabic  or  Persian,  as  appropriate.  They  are  supplied  if 
omitted  from  the  Turkish  original.  (Systematic  romanization  in  these  cases  promotes 
reversibility.  See  the  romanization  tables  for  Arabic  and  Persian.)  No  attempt  is  made  to 
represent  the  Ottoman  Turkish  pronunciation  of  vowels  beyond  what  is  conveyed  by  the 
orthography  of  the  orginal. 

10.  The  romanization  table  for  Ottoman  Turkish  (presented  on  the  next  two  pages)  is  based  on  the 
romanization  system  published  by  Eleazer  Birnbaum  in  the  Journal  of  the  American  Oriental 
Society,  volume  87  (1967).  A  few  slight  modifications  have  been  introduced. 
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10.  Romanization  Table  for  Ottoman  Turkish 

NOTE:  Use  table  if  principle  of  conversion  cannot  be  applied.  See  2.  in  RULES  OF 
APPLICATION. 

Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit  (see  Note  1) 

J 

C-J 

b 

mJ 

P 

J 

- 

Cli 

t 

Id 

G 

s 

c 

c 

2*U 

s 

S 

c- 

c 

h 

• 

c 

h 

2k- 

X 

d 

i 

i- 

X 

i 

z 

j 

jr 

j- 

V 

r 

J 

Jr 

J- 

J 

z 

J 

Jr 

Jr 

0 

j 

mmJjJ 

mUJm 

^r>~ 

s 

JJm 

O'' 

O' 

§ 

s 

O3* 

O3 

z 

L 

L>. 

JL. 

«L 

t 

L 

Jx 

Jx 

Ji 

z 

X 

A 

e- 

6 

‘  (ayn) 

X 

A 

e- 

6 

g 

3 

-L 

o 

f 

3 

iL 

L>- 

o 

k 

el 

el 

k 

g  (see  Note  2) 

- 

el 

eJ 

n  (see  Note  3) 

J 

1 

J- 

J 

1 

JQ 

jO. 

h 

P 

m 

J 

-L 

O* 

O 

n 

3 

3- 

> 

j 

v  (see  Note  4) 

J2> 

•e 

o  <  o 

h  (see  Note  5) 

mJ 

*3 

y  (see  Note  6) 
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Notes 

1.  For  orthographic  uses  of  this  letter,  see  the  Arabic  and  Persian  romanization  tables. 

2.  Frequently  written  without  the  distinguishing  upper  strokes. 

3.  Frequently  written  without  the  distinguishing  dots. 

4.  For  the  uses  of  this  letter  other  than  to  represent  the  consonantal  sound  v,  see  the  Persian 

romanization  table. 

5.  For  the  distinction  between  o  and  &  see  the  Arabic  romanization  table. 

6.  For  the  uses  of  this  letter  other  than  to  represent  the  consonantal  sound  y,  see  the  Persian 

romanization  table. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMAR 

/ 

soft  sign  (prime) 

A7 

J 

alif 

AE 

< 

ayn 

BO 

1 

Turkish  i  (small) 

B8 

Character  Modifiers 

Name 

USMAR 

6 

acute 

E2 

□ 

circumflex 

E3 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

6 

dot  above 

E7 

□ 

umlaut  (dieresis) 

E8 

5 

cedilla 

FO 

□ 

• 

dot  below 

F2 

0 

underscore 

F6 

0 

upadhmaniya 

F9 
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Pali 

(in  Various  Scripts) 


Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used 
for  vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two 
is  made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
romanization,  except  when  another  vowel  is  indicated  by  its  appropriate  sign. 

3.  Exception:  NiggahTta  and  sahhaka  combinations  representing  nasals,  are  romanized  by: 

h  before  gutterals,  h  before  palatals, 

n  before  cerebrals,  n  before  dentals,  and 

m  before  labials. 

4.  In  Bengali  script,  ba  and  va  are  not  differentiated.  The  romanization  should  follow  the 
value  of  the  consonant  in  the  particular  passage,  ascertainable  by  checking  the  same  passage 
as  printed  in  other  scripts. 


Romanization 

Bengali 

Burmese 

Vowels  (see  Note  1) 

a 

S3 

a 

3ft 

333 

i 

CO 

n. 

T" 

1 

d 

U 

e 

0 

u 

e 

e 

e 

o 

'Q 

ego 

Consonants 

(see  Note  2) 

Gutturals 

ka 

03 

kha 

9 

ga 

0 

gha 

sr 

CO 

ha 

$ 

c 

Palatals 

ca 

5 

cha 

so 

ja 

c 

jha 

na 

43 

£ 
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Devanagan  Sinhalese  Thai 


3T 

a, 

3TT 

qo 

s> 

a 

i 

3 

a 

c 

a 

c* 

9 

a 

(5 

u 

ia 

3ft 

© 

la 

*p 

22) 

n 

3 

7J 

IT 

CD 

FI 

*r 

ca 

'W 

© 

4 

6 

d 

ft 

ar 

d 

<3 

22© 

04 

ar 
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Romanization 

Khmer 

Lao 

Vowels  (Independent)  (see  Note  1) 

a 

H 

3 

a 

dl 

3? 

i 

l 

o 

9 

T 

rv 

1 

u 

9 

U 

3 

? 

u 

X  z 

3 
«  ■ 

e 

h 

f3 

o 

6,  5 

\d 

Tua  Tham/A 

Tua  Tham/B 

Northern 

*3 

. 

S35 

- 

eo 

o 

3 

9 

SJ 

S* 

8 

& 

& 

& 

e 

& 

% 

- 

Vowels  (Dependent)  (see  Note  1) 


a 

□ 

□ 

cT 

tf 

a 

cn 

□  2 

d* 

D° 

D° 

i 

a 

o 

□ 

□ 

a 

D 

TT 

1 

s 

rw 

□ 

a 

a 

<v 

D 

U 

□ 

Pl 

U 

□ 

\J 

□ 

u 

□u 

□a 

e 

id 

CD 

CD 

CD 

CD 

o 

tm 

'Id 

CD"* 

CD"5 

& 

Consonants  (see  Note  2) 


Gutturals 


ka 

n  D 

A 

n 

<r> 

CO 

cn 

kha 

2  D 

e> 

5 

9 

3 

3 

ga 

D 

a 

a 

r> 

o 

o 

gha 

w  nj 

'20 

*5LL> 

9x> 

’ey 

na 

6  o 

V 

? 

tn 

(S) 

Palatals 

ca 

AA  __ 

u  8 

5> 

0 

o 

£> 

cha 

Ip  D 

Ja 

a 

£> 

oo 

OJ 

ja 

ti  D 
o 

? 

CN 

EJv 

9 

jha 

? 

$1 

na 

CD  S’g 

OJ 

£ 

£ 

e5 
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Romanization 

Bengali 

Burmese 

Devanagari 

Cerebrals 

ta 

T> 

R 

z 

tha 

S 

z 

da 

$ 

3 

¥ 

dha 

U 

o 

z 

na 

•f 

OD 

Dentals 

ta 

Ss 

CO 

tha 

00 

«T 

da 

ff 

3 

dha 

*t 

0 

«r 

na 

51 

? 

Labials  (see  Note  4) 

pa 

o 

q 

pha 

cs 

q> 

ba 

* 

o 

sr 

bha 

CO 

^r 

ma 

51 

0 

*T 

Semivowels  (see  Note  4) 

ya 

CO 

q 

ra 

? 

s 

la 

5T 

CO 

}a 

s 

o5 

va 

3 

o 

q 

Sibilant 

sa 

>1 

CO 

¥ 

Aspirate 

ha 

5 

CO 

Sinhalese 


0 
G j 

Q 

eS 

•€£> 


235 

O 

Q 

2T5 


£3 

6 

a 

to 

® 


G3 

d 

e 

© 

Q 


es 


eo 


Niggahita  (see  Note  3) 

fi  m 


Visagga 
:  h 


Thai 


9] 

Cl 

VI 

s 

u 


u 

vt 

n 

3J 


fj 

“5 

vr 

'j 


Vi 
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Romanization 

Khmer 

Lao 

Tua  Tham/A 

7wa  Tham/B 

Northern 

Cerebrals 

ta 

n  □ 

S) 

li 

& 

83 

13 

tha 

u1  □ 

ct 

n 

9 

8 

8 

da 

*  □ 
y 

in 

<? 

* 

dha 

w  SJ 

OJ 

<2J5 

so 

so 

na 

an  □ 

cvn 

m 

on 

CO 

CV> 

Dentals 

ta 

R  □ 

o 

o 

on 

on 

on 

tha 

t!  □ 

c* 

? 

Q 

G 

C) 

da 

5  □ 

m 

© 

G 

© 

dha 

2  □ 

£> 

O 

O 

© 

na 

8  □ 

V 

o. 

e. 

c 

Labials 

pa 

u  OJ 

\3 

<J 

0 

iT 

pha 

t!  0 
u 

C5 

a 

6J 

to¬ 

ba 

n  □ 

ti 

OJ 

eo 

CO 

co 

bha 

□ 

n 

n 

o 

33 

JO 

ma 

u  □ 

-6 

u 

9 

4 

Semivowels 

ya 

tu  DJ 

0 

U3 

a> 

to 

ra 

J  (u 

s 

S 

vs 

5 

la 

fU  □ 

a? 

CO 

OJ 

la 

fl  - 

6 

S' 

£ 

o 

f?) 

va 

t  n 

3 

o 

o 

O 

Sibilant 

sa 

fd  DJ 

3 

AJ 

Si 

XU 

Aspirate 

ha 

tn  □ 

m 

cn 

ID 

in 

m 

Niggahita  (see  Note  3) 

Visagga 

□  m 

:  h 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 
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Panjabi 

(in  Gurmukhi  Script) 


Vowels  and  Diphthongs  (see  Note  1) 


»n 

t 

i 


a 

a 

i 

t- 

l 

u 

u 


§  e 

f  ai 

§  o 

>>i  au 


Consonants  (see  Notes  2  and  3) 


Sibilants 


Aspirate 


Gutturals  Palatals 


Tt 

sa 

TJ 

ha 

3 

ka 

3 

ca 

sha 

¥ 

kha 

S 

cha 

M 

kha 

ft 

ja 

31 

ga 

a 

za 

31 

gha 

¥ 

jha 

m 

gha 

¥ 

na 

5 

ha 

Cerebrals 

Dentals 

Labials 

Semivowels 

3 

ta 

3 

ta 

y 

pa 

Cl 

ya 

3 

tha 

¥ 

tha 

3 

pha 

3 

ra 

¥ 

da 

da 

3 

fa 

7? 

la 

¥ 

dha 

¥ 

dha 

¥ 

ba 

7? 

la 

5 

na 

75 

na 

3 

bha 

3 

wa 

W 

ma 

3 

ra 

Bind 1 

Tippi 

Adhik 

(see  Note  4) 

(see  Note  5) 

(see  Note  6) 

& 

m 

n 

m 

[doubles  the 

following 

consonant] 
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Notes 


1 .  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  consonant  clusters  and  may  be  implicit  after  consonants  except 
when  they  are  final  or  when  another  vowel  is  indicated  by  its  appropriate  sign.  The  cases  in 
which  the  vowel  a  is  implicit,  however,  can  be  determined  only  from  a  knowledge  of  the 
language  or  from  suitable  reference  sources.  In  such  cases  the  a  is  supplied  in  transliteration. 

3.  The  dotted  letters  are  used  in  Urdu  words. 

4.  Exception:  Bind!  is  transliterated  by: 


h  before  gutturals, 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 


5.  Exception:  Tippi  is  transliterated  by: 


h  before  gutturals, 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 


6.  Exception:  When  adhik  implies  the  combination  of  a  non-aspirated  and  an  aspirated 
consonant,  the  combination  is  transliterated  as  a  non-aspirated,  followed  by  an  aspirated 


consonant. 


CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  modifiers  Name 


USMARC  hexadecimal  code 


S 

□ 


tilde 

macron 

breve 


E4 

E5 

E6 

E7 

F2 

F6 


□ 


□ 

□ 


dot  above 
dot  below 
underscore 
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Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit  (see  Note 

J 

-J* 

v>- 

C-J 

b 

mJ 

V 

V— J 

P 

J 

O. 

Ci 

t 

J 

Cli 

s 

2? 

zy- 

C- 

e 

j 

2H- 

®- 

s 

ch 

C* 

c 

h 

A. 

6 

c 

kh 

d 

i 

:L 

i 

i 

z 

V 

V 

r 

J 

Jr 

3- 

3 

z 

J 

Jr 

3~ 

3 

zh 

-«ui» 

o**- 

s 

JjJ 

■oUita 

v-A- 

sh 

o=>* 

u*1 

s 

-J* 

-*=»- 

iA- 

u*3 

z 

L 

J^ 

-L. 

Jo 

t 

h 

Ji. 

Jo 

z 

s. 

A 

e- 

e 

‘  (ayn) 

X 

ic 

e- 

e 

gh 

J 

jL 

t_0. 

*_s 

f 

ji 

iL 

J- 

0» 

q 

viL 

k  (see  Note  2) 

3 

i. 

g  (see  Note  3) 

J 

i 

J- 

J 

1 

>0 

A. 

r- 

p 

m 

J 

-i- 

O- 

O 

n 

3~ 

3- 

J 

v  (see  Note  3) 

a> 

■6 

<x  <  <x 

o  ,  6 

h  (see  Note  4) 

u 

(_T~ 

»3 

y  (see  Note  3) 
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Vowels  and  Diphthongs  (see  Note  5) 


□ 

a 

T<  IS  a 
(see  Note  6) 

v^g 

T 

l 

9 

□ 

u 

a 

(see  Note  7) 

O  -*■ 

jn 

aw 

□ 

•*>* 

i 

3U  u 

o 

*J0 

ay 

Notes 

1.  For  the  use  of  I  ( alif)  to  support  *  (hamzah)  and  □  ( maddah )  see  rule  1(a).  For  the 

romanization  of  *  and  □  see  rules  4  and  5  respectively.  For  the  use  of  I  to 
represent  the  long  vowel  romanized  a  see  the  table  of  vowels  and  diphthongs,  and 
rule  1(b). 

2.  Final  ^  and  ^  (often  written  ^  and  ^)  may  have  the  form  without  the  distinguishing 

upper  stroke  or  strokes.  The  two  letters  are  always  distinguished  in  romanization. 

3.  For  other  values  of  3  and  ^  see  the  table  of  vowels  and  diphthongs,  and  rules  2,  3,  and 

7. 

4.  &  (dotted  °)  when  used  as  an  alternative  to  ^  is  romanized  t. 

5.  Vowel  points  are  not  printed  on  Library  of  Congress  cards. 

6.  See  rules  1(b)  and  5. 

7.  See  rule  3(d). 


RULES  OF  APPLICATION 


Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 


1.  I  ( alif)  is  used: 

(a)  As  a  support  for  *  ( hamzah )  and  □  (maddah).  In  these  cases  it  is  not  represented  in 
romanization.  See  rules  4  and  5. 

(b)  To  indicate  the  long  vowel  romanized  a.  For  the  use  of  I  in  tanvrn  see  rule  6. 

dana  Lib 

2.  3  is  used  to  represent: 

(a)  The  consonant  romanized  v. 

varzish  3^3 

dava 

sarv 
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Silent  3  following  is  retained  in  romanization. 

khvastan  0^1 

khvud 

(b)  The  long  w -vowel  (and  short  w-vowel  in  some  monosyllables)  is  romanized  u. 

dur  ^3^ 

chun  Oy? 

tu 


(c)  The  diphthong  romanized  aw. 

Fir  daws! 


When  the  diphthong  precedes  a  consonantal  3,  the  combination  is  romanized  aw.  See 
rule  7. 

3  may  be  used  as  a  support  for  *  ( hamzah );  in  this  case  it  is  not  represented  in 
romanization.  See  rule  4. 


3. 


*3  is  used  to  represent: 


(a)  The  consonant  romanized  y. 
yar 
siyah 
pay 


(b)  The  long  vowel  romanized  1. 

Tran 

qalf 


uL*l 


(c)  The  diphthong  romanized  ay. 

ayvan 

ray 


*3^ 


(d)  The  final  long  vowel  romanized  d. 

Mustafa 


JlL. 


For  the  use  of  as  a  mark  of  izafah  see  rule  8(c). 


in  the  medial  forms  without  dots,  may  be  used  as  a  support  for  *  (hamzah);  in  this 
case  is  not  represented  in  romanization.  See  rule  4  below. 
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Orthographic  Symbols  Other  Than  Letters  and  Vowel  Signs 

The  signs  listed  below  are  frequently  omitted  in  Persian  writting  and  printing;  their  presence  must 
then  be  inferred.  They  are  represented  in  romanization  according  to  the  following  rules: 

4.  *  ( hamzah ) 


(a)  When  initial,  *  is  not  represented  in  romanization. 


(b)  When  medial  or  final,  *  is  romanized  ’  (alif)  except  as  noted  in  (c)  and  (d)  below, 
mu’assir 

khulafa  ’  *  Lik* 

pa’m 


(c)  When  used  as  a  mark  of  izafah ,  <=  is  romanized  -7. 

astanah-’i  dar  ^ 

(d)  When  used  to  mark  the  indefinite  article,  *  is  romanized  7. 

khanah’i 

□  (maddah) 


(a)  Initial  T  is  romanized  a 
ab 

Kulliyat  al-Adab 


A^\ 


(b)  Medial  »,  when  it  represents  the  phonetic  combination  ’a,  is  so  romanized. 

ma’asir 

Darya’abadf 

a/ 

(c)  □  is  otherwise  not  represented  in  romanization. 

gardavarandah  T  i J*> 

Tanvrn  (written  □,  § ,  □,  Iq  ),  which  occurs  chiefly  in  Arabic  words,  is  romanized  un,  in,  an,  and 
an,  respectively. 


7.  □  ( shaddah  or  tashdid )  is  represented  by  doubling  the  letter  or  digraph  concerned. 


khurram 

awal 

bachchah 

Khayyam 


P> 

Jjl 
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Note  the  exceptional  case  where  B  is  written  over  3  and  3  to  represent  the  combination  of  long 
vowel  plus  consonant. 

nashriyat 

quvah 


Grammatical  Structure  as  It  Affects  Romanization 

8. 


Izafah.  When  two  words  are  associated  in  the  relation  known  as  izafah,  the  first  (the  muzaf) 
is  followed  by  an  additional  letter  or  syllable  in  romanization.  This  is  added  according  to  the 
following  rules: 


(a) 


When  the  muzaf  bears  no  special  mark  of  izafah,  it  is  followed  by  -i. 
dar-i  bagh  £)-t 

qalf-i  Iran  0U1I  ^ 

khanah-i  buzurg  &  .  -s  ^1^ 


(b)  When  the  muzaf  is  marked  by  the  addition  of  <=,  it  is  followed  by  -7. 

qalF-’i  Fran  uLrfl 

khanah-’i  buzurg  ^  .  >j  «\_iU 


(c) 


When  the  muzaf  is  marked  by  the  addition  of  <3,  it  is  followed  by  -yi. 
ru-yi  zamih  33^ 

Darya-yi  Khazar  jjr 

khanah-yi  buzurg  *3^^ 


(d)  Izafah  is  represented  in  romanization  of  personal  names  only  when  expressly  indicated  in 
the  Persian  script. 


Affixes  and  Compounds 

9. 


Affixes. 

(a)  When  the  affix  and  the  word  with  which  it  is  connected  grammatically  are  written 
separately  in  Persian,  the  two  are  separated  in  romanization  by  a  single  prime  (')•  See 


but 


but 


khanah'ha 

khanah'am 

khanah/f 

mP  ravam 

mfravam 

?3jrP> 

bih'gu 

3$<^ 

bar'raslha 

Kazan' zadah 

0^1 j  f>Lu 

Kazimzadah 
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(b)  The  Arabic  article  al  is  separated  by  a  hyphen,  in  romanization,  from  the  word  to  which 
it  is  prefixed. 


10. 


dar  al-mu‘allimm 

li=r  A  11 

‘Abd  al-Husayn 

‘  h 

Compounds.  When  the  elements  of  a  compound  (except  a  compound  personal  name)  are 
written  separately  in  Persian,  they  are  separated  in  romanization  by  a  single  prime  (/).  See  also 

12(b)  below. 

marfz'khanah 

but  marfzkhanah 

Shah'namah 

<XjoLi  oLii 

but  Shahnamah 

4joL|£)Lu 

Note  the  treatment  of  compound  personal 

names: 

Ghulam  ‘Alf 

or 

Shah  Jahan 

or 

Ibn  AbF  Talib 

<_J  1 1 — >  ^  j  1  i  or  ^es  ^  1  O-y  1 

Orthography  of  Persian  in  Romanization 


11.  Capitalization. 


(a)  Rules  for  the  capitalization  of  English  are  followed,  except  that  the  Arabic  article  al  is 
lower  cased  in  all  positions. 

(b)  Diacritics  are  used  with  both  capital  and  lower  case  letters. 


12.  The  single  prime  (')  is  used: 


(a)  To  separate  two  letters  representing  two  distinct  consonantal  sounds,  when  the 
combination  might  otherwise  be  read  as  a  digraph. 
marz/ha 


(b) 


To  mark  the  use  of  a  letter  in  its  final  form  when  it  occurs  in  the  middle  of  a  word.  See 
also  rules  9(a)  and  10  above. 


rah '  ha 

Qayim/maqamf 

Bih'azm 


La>olv 

, ,o LsLa  pLjla 
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13.  Foreign  words  in  a  Persian  context,  including  Arabic  words,  are  romanized  according  to  the  rules 
for  Persian.  For  short  vowels  not  indicated  in  the  script,  the  Persian  vowels  nearest  the  original 
pronunciation  of  the  word  are  supplied  in  romanization. 

14.  Dictionaries. 

In  romanizing  Persian,  the  Library  of  Congress  has  found  it  necessary  to  consult  dictionaries  as 
an  appendage  to  the  romanization  tables,  primarily  for  the  purpose  of  supplying  vowels.  For 
Persian,  the  principle  dictionary  consulted  is: 

M.  MuTn.  Farhang-i  FarsT-i  mutavassit. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

9 

alif 

AE 

( 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

□ 

underscore 

F6 
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Pushto 

Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit  (see  Note 

-J 

•J*. 

1-*. 

LJ 

b 

mJ 

*- 

V- 

CJ 

P 

J 

CL 

t 

I 

Tf“ 

V- 

V 

t 

J 

CL 

s 

£* 

e- 

c 

j 

J£k. 

®- 

s 

ch 

& 

c- 

c 

h 

• 

2* 

i*- 

c 

• 

s 

* 

*• 

e- 

c 

z 

6 

c 

kh 

x 

X 

d 

* 

J 

d 

i 

X 

X 

i 

z 

V 

V 

*r 

V 

r 

-r 

-T 

X 

r 

J 

Jr 

J” 

J 

z 

0 

0“ 

0" 

0 

zh 

-X 

XT 

V- 

zh 

mAJj* 

o**- 

s 

-ui 

•JjJm 

v_p» 

0“ 

sh 

Xu 

^Jdm 

v-4*- 

<->r 

sh 

s 

iO3- 

z 

L 

L. 

Jo. 

Jo 

t 

L 

Jx 

Jx 

Jo 

z 

X 

A 

e- 

6 

‘  (ayn) 

X 

JEL 

e- 

e 

gh 

j 

Jl 

t-X 

i_a 

f 

i 

JL 

V>- 

0 

q 

v*i. 

ej 

k 

3 

g 

J 

i. 

g 

J 

1 

J- 

J 

1 

JO 

A. 

r- 

t 

m 

J 

-L 

O- 

o 

n 

* 

v- 

V 

n 
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Notes 


-r* 

-r*" 

-r*“ 

-r1 

n 

3 

3- 

3- 

3 

w  (see  Note  2) 

a> 

■6 

A.  <  A. 

0  <  '6 

h 

u 

tt- 

S?  ‘  v5 

y  (see  Note  3) 

and  Diphthongs 

(see  Note  4) 

Romanization 

Initial 

Medial 

Final 

a 

(see  Note  5) 

1 

J? 

□ 

□ 

u 

1 

□ 

□ 

i 

1 

T 

□ 

□ 

a 

6 

1  □ 

a 

- 

(see  Note  6)  ,3  □ 

u 

3  * 

3U 

3  □ 

«— 

l 

1 

*•  *•» 

**  ✓ 

^9  ‘^9 

o 

3° 

3  □ 

e 

J  .  -  1 

-t-o  t 

£_□  <  c  ^$0  < 

aw 

V 

j>6 

30 

ay 

IT 

0  ^ 

-e-n 

0  **  0 

kjsd  <  csn 

ay 

1. 

For  the  use  of  1  ( alif)  see 

the  table  of  vowels  and  diphthongs,  and  rules  1-4. 

2. 

For  other  uses  of  3  see  the  table  of  vowels  and  diphthongs,  and  rule  5. 

3. 

For  other  forms  and  used  of  see  the  table  of  vowels  and  diphthongs,  and  rules 

6. 

7. 


Vowel  points  are  rarely  found  in  Pushto  writing  and  printing.  Vowels  not  indicated  in  the 
script  are  supplied  in  romanization. 

In  addition  to  the  three  short  vowels  recognized  by  the  orthography  of  all  languages  using 
Arabic  script,  Pushto  possesses  a  short,  central  vowel  for  which  the  sign  □ 
(differentiated  from  □  )  has  sometimes  been  used.  Both  signs,  whether  written  or 
inferred,  may  be  romanized  a.  When  this  central  vowel  is  indicated  by  *  ( hamzah ), 
the  vowel  is  romanized  a\  the  hamzah  in  this  case  is  not  represented  in 
romanization.  See  rule  8(c)  and  (d).  When  it  is  desired  for  any  reason  to  show  the 
presence  of  a  central  vowel  in  a  particular  word,  the  romanization  a  may  be  used. 

See  rule  7(a). 

See  rule  7(b). 
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RULES  OF  APPLICATION 


Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 


I  ( alif)  is  used: 

(a)  To  indicate  the  presence  of  any  of  the  short  vowels  a,  u,  L 
atah  ^ 

ulas 

>Lsej| 


in  am 


Pl 


(b)  To  represent  the  long  vowel  romanized  a.  When  so  used,  I  is  usually  written  with  □ 
( maddah ). 


adam 


P 


(c)  To  represent,  in  combination  with  3  or  : 


(1)  The  long  vowels  romanized  u  and  T  respectively, 
ush 

Iran  oLh' 


(2)  The  long  vowels  romanized  o  and  e  respectively. 


os 

eshal 


(3)  The  diphthongs  romanized  aw  and  ay  respectively. 

awwal  Jjl 

ayyam  j>Ld 


2. 


(alif),  when  medial  or  final,  is  used  to  represent  the  long  vowel  romanized  a. 
baba 


3.  I  (alif),  when  final,  sometimes  represents  the  combination  romanized  an.  This  value  is  confined 
to  a  few  words  of  Arabic  origin.  See  rule  10. 

4.  I  (alif),  when  used  to  support  *  (hamzah),  is  not  represented  in  romanization;  see  rule  8. 

5.  3  is  used  to  represent: 

(a)  The  consonant  romanized  w. 

wror  vJvJ 


In  Persian  words,  silent  3  following  «  is  retained  in  romanization. 
khwushhal 

(b)  The  long  vowel  romanized  u.  See  also  rule  l(cl). 

num 

(c)  The  long  vowel  romanized  o.  See  also  rule  l(c2). 

kor  _j3^ 
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(d)  The  diphthong  romanized  aw.  See  also  rule  l(c3). 

yaw  31 

(e)  The  short  vowel  romanized  u.  Such  cases  are  rare  in  Pushto  orthography. 

u 

Note:  3  when  used  to  support  *  ( hamzah )  is  not  represented  in  romanization.  See  rule  8. 

6.  is  used  to  represent: 

(a)  The  consonant  romanized  y. 

yaw  3n 

(b)  The  long  vowel  romanized  i.  See  also  rule  l(cl) 

mrnah 

(a)  The  long  vowel  romanized  e.  See  also  rule  l(c2). 

der  ^  ^ 

dre  C— ^  (  ^3 ^ ^ 

(c)  The  diphthong  romanized  ay.  See  also  rule  l(c3). 

shay 

Note:  Medial  when  used  to  support  *  (hamzah)  is  usually  written  without  dots  <  ■$•); 
it  is  not  represented  in  romanization.  See  also  rule  8(b)  and  (c) 

7.  when  final  has  two  special  uses.  It  may  represent: 

(a)  The  long  vowel  romanized  d.  This  use  is  confined  to  word  of  Arabic  origin. 

Mustafa  5-iL^uo 

•  •• 

(b)  The  diphthong  romanized  ay.  When  used  with  this  value,  v5  usually  has  the  form  in 
Peshawar,  in  Afghanistan.  See  also  rule  8(c). 

doday  <  ^3* 


Orthographic  Symbols  Other  Than  Letters  and  Vowel  Signs 

The  signs  listed  below  are  often  omitted  in  Pushto  writing  and  printing;  their  presence  must  be 
inferred.  They  are  represented  in  romanization  according  to  the  following  rules: 

8.  t  (hamzah) 

(a)  In  initial  position,  <=  is  not  represented  in  romanization. 

(b)  In  words  of  Arabic  origin,  <=  when  medial  or  final  is  romanized  ’  (alif). 

su’al  Jlj-^ 

da’imF 

mas  ’  alah  aJ  L^o 
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(c)  When  written  over  final  v5  or  c_,  *  indicates  the  diphthong  romanized  ay.  See  rule  7(b). 

ghwaray 

(d)  In  the  orthography  of  Peshawar,  *>  when  written  over  final  °  indicates  the  short  vowel 
romanized  a.  See  the  table  of  vowels  and  diphthongs,  footnote  7. 

nah  ^ 


9.  □  ( maddah )  is  written  over  °  ( alif)  to  indicate  the  long  vowel  romanized  a. 

akhir  ^ 


10.  Tanwin  (written  □  )  occurs  in  words  of  Arabic  origin  and  represents  a  final  syllable  which  is 
romanized  an.  Tanwin  is  usually  associate  with  I  (alif),  which  is  not  represented  in  romanization. 
See  rule  3. 

fitratan 

11.  □  (tashdud)  is  represented  by  doubling  the  letter  or  digraph  concerned. 

awwal  Jjl 


Grammatical  Structure  as  It  Affects  Romanization 

12.  Izafah.  When  two  words  are  associated  in  the  Persian  construction  known  as  izafah,  the  first 
(the  muzaf)  is  followed  by  an  additional  letter  or  syllable  in  romanization.  This  is  added 
according  to  the  following  rules: 

(a)  When  the  muzaf  bears  no  special  mark  of  izafah,  it  is  followed  by  -i. 

dars-i  ‘ibrat  ^ ^ 

sahib-i  mubarak  ^ 

(b)  When  the  muzaf  is  marked  by  the  addition  of  <=  (hamzah),  it  is  followed  by  -’i. 

(c)  When  the  muzaf  is  marked  by  the  addition  of  ^s,  it  is  followed  by  -yi. 

Affixes  and  Compounds 

13.  Affixes. 

When  the  affix  and  the  word  with  which  it  is  connected  grammatically  are  written  separately  in 
Pushto,  the  two  are  separated  in  romanization  by  a  single  prime  (/). 

na'ashna  Li-ilL 

war 1  astawf  I ^3 

Note  1:  The  particle  ^  (da),  meaning  "of,  is  always  written  as  a  separate  word  in  romanization. 
da  dah  * 

Note  2:  The  Arabic  article  al  is  separated  from  what  follows  by  a  hyphen.  The  /  of  the  article 
is  retained  in  romanization,  regardless  of  whether  or  not  this  /  is  assimilated  in  pronunciation  to 
the  sound  of  the  following  letter. 

‘Abd  al-Rashld  ^ 

Fazl  al-Haqq 

Nizam  al-Dfn  pU^ 
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14.  Compounds. 

The  elements  of  a  compound  (except  a  compound  personal  name)  are  separated  in  romanization 
by  a  single  prime  (/). 

kitab'khanah 
but  manzkhanah 

Orthography  of  Pushto  in  Romanization 

15.  Capitalization. 

(a)  Rules  for  the  capitalization  of  English  are  followed,  except  that  the  Arabic  article  al  is 
lower  cased  in  all  positions. 

(b)  Diacritics  are  used  with  both  capital  and  lower  case  letters. 

16.  Foreign  words  in  a  Pushto  context,  including  Arabic,  Persian,  and  Urdu  words,  are  romanized 
according  to  the  rules  for  Pushto.  For  vowels  not  indicated  in  the  script,  the  Pushto  vowels 
nearest  those  in  the  original  pronunciation  of  the  word  are  supplied  in  romanization. 

Yunlwarsitf  Buk  Ejansf 
( not  University  Book  Agency) 

Darmastatar  v  I  i 

{not  Darmesteter) 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

5 

alif 

AE 

C 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

0 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

o 

double  dot  below 

F3 

□ 

underscore 

F6 
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Russian 


Vernacular 

Upper  case  letters 

A 

B 

B 

r 

n 

E 

E 

>K 

3 
M 

I  (see  Note  1) 

M 

K 

n 

M 

H 

o 

n 

p 

c 

T 

y 

$ 

x 

ii 

4 

in 

ni 

Ta  (see  Note  2) 

bl 

b  (see  Note  1) 
13  (see  Note  3) 
3 
K) 

n 

A  (see  Note  4) 
0  (see  Note  3) 
V  (see  Note  3) 


Romanization 


A 

B 

V 
G 
D 
E 
E 
Zh 
Z 

I 

I 

I 

K 

L 

M 

N 

O 

P 

R 

S 

T 

U 

F 

Kh 

fS 

Ch 

Sh 

Shch 

"  (hard  sign) 

V 

'  (soft  sign) 

ffi 

£ 

m 

TA 

£ 

F 

V 


Vernacular 

Lower  case  letters 

a 

6 

B 

r 

a 

e 

e 

x 

3 

M 

i  (see  Note  1) 

ii 
K 

n 

M 

H 

O 

n 

P 

c 

T 

y 

X 

U 

H 

Hi 

LU 

■b  (see  Note  2) 

bi 

b  (see  Note  1) 
t.  (see  Note  3) 

3 

K) 

a 

a  (see  Note  4) 
e  (see  Note  3) 
v  (see  Note  3) 


Romanization 


a 

b 

v 

g 

d 

e 

e 

zh 

z 

i 

T 

1 

V 

1 

k 

1 

m 

n 

o 

P 

r 

s 

t 

u 

f 

kh 

/ — \ 
ts 

ch 

sh 

shch 

"  (hard  sign) 

y 

'  (soft  sign) 

o 

le 

e 

<-'• 

1U 
< — ^ 
la 

f 

y 
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Notes 

1.  Do  not  confuse  with  similar  part  of  the  letter  bl,  bi  (romanized  Y,  y). 

2.  Letter  is  disregarded  in  romanization  when  found  at  the  end  of  a  word. 

3.  Letter  is  considered  obsolete  for  the  modern  Russian  Cyrillic  alphabet;  found  primarily  in 
prerevolutionary  and  emigre  publications. 

4.  Church  Slavic  letter  occasionally  found  in  the  modern  Russian  Cyrillic  alphabet. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

// 

hard  sign  (double  prime) 

B7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

n 

Li 

umlaut  (diearesis) 

E8 

G 

ligature,  1st  half 

EB 

0 

ligature,  2nd  half 

EC 

right  hook 

FI 
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Sanskrit  and  Prakrit 

(in  Devanagari  Script) 


When  Sanskrit  is  written  in  another  script,  the  corresponding  letters  in  that  script  are  transliterated 
according  to  this  table. 

Vowels  and  Diphthongs  (see  Note  1) 


3T 

3TT 

% 

i 

& 


a 

a 

i 

TT 

1 

U 

U 

r 


Consonants  (see  Note  2) 


3T 

c 

aft 

3ft 


r 

o 

1 

o 

e 

ai 

o 

au 


Gutturals 


Palatals 


Cerebrals 


Dentals 


fr 

ka 

ca 

^  ta 

5T  ta 

W 

kha 

cha 

3  tha 

st  tha 

*T 

ga 

ja 

¥  da 

da 

gha 

jha 

s  dha 

sr  dha 

¥ 

na 

3T 

na 

^  na 

^  na 

Labials 

Semivowels 

Sibilants 

Aspirate 

pa 

ya 

3T  sa 

^  ha 

pha 

X 

ra 

q-  sa 

ba 

la 

^  sa 

bha 

55 

la 

ma 

¥ 

va 

Anusvara 

Anunasika 

Visarga 

Jihvamuliya 

(see  Note  3) 

6 

m 

8 

MJ 

m 

:  h 

)( 

Upadhmaniya 

^  h 


Avagraha 
(see  Note  4) 

s  »  (apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  subscript  sign  (  n  )  called  halanta  or 
virama. 

3.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
ii  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

4.  When  doubled,  avagraha  is  transliterated  by  two  apostrophes  (  ”  ). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

y 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

fi 

candrabindu 

EF 

□ 

dot  below 

F2 

0 

circle  below 

F4 

□ 

underscore 

F6 

0 

upadhmaniya 

F9 
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Santali 

(in  01  Script) 


Vowels 


2) 

a 

b 

u 

a 

a 

2 

e 

». 

a 

• 

3 

o 

7t 

i 

Consonants 

b 

k 

0 

P 

G 

g 

0) 

b 

$ 

ft 

a 

m 

6 

c 

a 

y 

0 

j 

$ 

r 

Q 

n 

p 

1 

T) 

t 

y 

w 

(J) 

d 

v: 

s 

2 

r 

0 

h 

0 

n 

© 

h 

0 

t 

9 

’  (apostrophe) 

© 

d 

(see  Note  1) 

z 

n 

3 

m 

n 

m  (see  Note  2) 

Notes 

1.  The  letter  9  is  used  with  "checked"  (voiceless)  consonants  and  indicates  a  change  in 
pronunciation.  It  is  written  following  four  consonants  only,  the  letters  G,  U,  ©,  (1), 
which  are  romanized  g,  j,  d,  and  b  respectively. 

2.  The  mark  □  (a  dot  written  above  the  line  after  a  vowel)  is  used  to  indicate  nasalization  and 
is  romanized  as  m  after  the  vowel  it  modifies.  Another  mark,  □  (written  below  the  line  after 
the  vowel  it  modifies)  is  used  to  indicate  low  pitch  and  is  omitted  in  romanization. 
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Other  Marks  Used  in  Ol  Script 

1.  The  mark  □  ,  written  over  a  letter,  is  used  to  indicate  the  prolongation  of  a  vowel.  It  is 
omitted  in  romanization. 

2.  The  mark  -  (similar  to  a  hyphen)  is  used  to  separate  "checked"  consonants  from  other 
consonants  or  vowels.  It  is  omitted  in  romanization. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  character 

Name 

USMARC  hexadecimal  code 

> 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

a 

candrabindu 

EF 

□ 

dot  below 

F2 

□ 

underscore 

F6 
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Serbian  and  Macedonian 


The  romanization  in  the  table  below  insures  correspondence  between  the  Serbian  and  present-day 
Croation  alphabets. 


Vernacular 


Romanization 


Vernacular 


Romanization 


Upper  case  letters 


Lower  case  letters 


A 

B 

B 

r 

a 

r 

a 

T) 

E 

3 

S 

M 

J 

K 

* 

K 

n 

jb 

M 

H 

H> 

O 

n 

p 

c 

T 

■R 

y 

$ 

x 

n 

H 

y 

in 


A 

B 

v 

G 

6 

D 

D 

E 

1 

Z 

Dz 

I 

J 

K 

£ 

L 

Lj 

M 

N 

Nj 

O 

P 

R 

5 
T 

6 
u 

F 

H 

C 

e 

Dz 


a 

6 

B 

r 

* 

r 

a 

b 

e 

x 

3 

s 

M 

j 

K 

✓ 

K 

n 

M 

H 

H> 

O 

n 

p 

c 

T 

h 

y 

X 

u 

M 

d 

III 


a 

b 

v 

g 

* 

g 

d 

d 

e 

z 

z 

dz 

i 

j 

k 

k 

1 

lj 

m 

n 

nj 

o 

P 

r 

s 

t 

a 

c 

u 

f 

h 

c 

V 

c 

dz 

V 

s 
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SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

D 

D  with  crossbar 

A3 

(upper  case) 

d 

d  with  crossbar 

B3 

(lower  case) 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

a 

hachek 

E9 
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Sindhi 

(in  Arabic  Script) 


Letters  of  the  Alphabet  (see  Note  1) 


Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit 

J 

e-*» 

C-J 

b 

mJ 

b 

mi 

V—A. 

<*— J 

bh 

li 

LJ 

t 

mt 

cx. 

o 

th 

•1 

«  • 

X. 

•  1 
iJli 

t 

J 

Cib 

C-i 

th 

-i- 

vU 

s 

mi 

r*- 

v 

UJ 

P 

3? 

p». 

£- 

£ 

j 

er- 

£ 

V 

J 

4=r 

4^ 

4^ 

4^ 

jh 

- 

•3<t_ 

S 

n 

£- 

S 

c 

SHmm 

r 

£ 

ch 

9%. 

c 

h 

9- 

9%. 

c 

kh 

X. 

x. 

d 

S 

l 

L 

s 

dh 

$ 

L 

L 

d 

x. 

x. 

d 

x 

x. 

dh 

i 

L 

L 

i 

z 

V 

V 

j- 

r 

- 

Jr 

J- 

V 

r 

J 

J~ 

J" 

J 

z 

^aJ 

mJjJm 

VX**- 

s 

mJjJm 

sh 

O3 

s 

C*3- 

u*3 

z 

L 

L. 

JL. 

io 

t 

L 

-L. 

.L. 

ii 

z 

X 

jai 

6 

‘  (ayn) 

X 

X 

e 

gh 

i 

.a. 

i_L 

f 

Jo 

Jfc- 

<JL 

<Jo 

ph 

ii 

iL 

<3 

q 
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S  s 

k 

kh 

3 

g 

3 

: 

4-  4- 

$ 

g 

4^-  4^- 

4* 

gh 

- 

& 

3 

ft 

J 

1  J- 

J 

1 

JO 

a. 

P 

m 

mJ 

-*■  O- 

O 

n 

u 

•J 

j-  j- 

u 

n 

J 

> 

3 

V 

S2> 

4  -  <  o  <  o  < 

h 

- 

- 

9 

omit 

-j 

■rf*  S?" 

y 

Vowels  and  Diphthongs  (see  Note  2) 

□ 

a 

r 

l 

u 

o 

□ 

i 

CrD  ‘ 

On  e 

Isn  *  v30  ‘ 

a 

au 

* 

3D 

u 

° 

on  ai 

Notes 

1.  When  Sindhi  is  written  in  the  Devanagari  script,  it  is  romanized  according  to  the  Hindi 

table.  The  letters,  ?  ,  ?  ,  5  ,  and 

n  are  romanized  b,  ;,  d,  and  g  respectively. 

2.  The  rules  of  application  of  Urdu  are 

to  be  used  for  Sindhi.  Vocalization  are  those  used 

for  entry  in  Mewaram’s  A  Sindhi  English  Dictionary  (1st  ed.,  Hyderabad,  1910),  Sindhi 
Urdu  Lughat  (Haidarabad,  1959),  and  Jami'-T Sindhi  Lughat  (KaracF,  1931). 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 

Special  Characters 

Name 

USMARC  hexadecimal  code 

< 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

a 

tilde 

E4 

0 

macron 

E5 

□ 

dot  above 

El 

•• 

□ 

umlaut  (dieresis) 

E 8 

□ 

6 

dot  below 

F2 

double  dot  below 

F3 

□ 

underscore 

F6 
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Sinhalese 


Sinhalese 


Vowels  and  Diphthongs  (see  Note  1) 


a 


d 


a 

© 

o 

<n 

a 

© 

6 

<n 

A 

a 

65a 

r 

o 

i 

£3aa 

r 

o 

3 

"T* 

1 

«S> 

1 

o 

c 

U 

T 

0 

c* 

u 

ai 

e> 

e 

©« 

au 

nts 

(see  Note  2) 

Gutturals 

Palatals 

Cerebrals 

Dentals 

2S> 

ka 

0 

ca 

G 

ta 

»  ta 

3 

kha 

6 

cha 

dj 

tha 

• 

^  tha 

CD 

ga 

6 

ja 

Q 

da 

5  da 

65 

gha 

25© 

jha 

£ 

dha 

^  dh; 

© 

ha 

ha 

na 

232  na 

Labials 

Semivowels 

Sibilants 

Aspirate 

£5 

pa 

05 

ya 

CD 

sa 

to  ha 

£ 

pha 

3 

ra 

85 

sa 

3 

ba 

e 

la 

63 

sa 

eo 

bha 

€ 

la 

© 

ma 

0 

va 

Anusvara 
(see  Note  3) 


Visarga 


Avagraha 


Sahhaka 
(see  Note  4) 


m 


(apostrophe) 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
romanization,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  sign  V  (or  ~ )  called  virama. 

3.  Exception:  Anusvara  is  transliterated  by: 

h  before  gutturals, 
h  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

4.  Exceptions: 

(a)  when  sahhaka  represents  a  nasal.  It  is  romanized  according  to  the  rule  for  anusvara. 

(b)  when  sahhaka  is  combined  with  an  aspirated  consonant,  the  combination  is  romanized 
as  a  non-aspirated,  followed  by  an  aspirated  consonant. 


SPECIAL  CHARACTERS  AND 

l  CHARACTER  MODIFIERS  IN  ROMANIZATION 

Special  character 

Name 

USMARC  hexadecimal  code 

y 

apostrophe 

27 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

circumflex 

E3 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

breve 

E6 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

Q 

circle  below 

F4 
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Tamil 


Vowels  and  Diphthongs  (see  Note  1) 


s\  a 

a 

§  i 

FT-  I 

2_  U 

g>STT  U 


Consonants  (see  Note  2) 


• 

•  • 

ka 

a 

ka 

rw 

ha 

9 

ca 

© 

ha 

L_ 

ta 

6337 

na 

s> 

ta 

fb 

na 

U 

pa 

Sanskrit  Sounds 

8  ja 

sa 

®P  sa 


6T 

67 

ffi 

9 

9 

Q6I7 


e 

e 

ai 

o 

5 

au 


ID 

UJ 

(7 

60 

60 

\S> 

6TT 

P 

637 


ma 

ya 

ra  (see  Note  3) 

la 

va 

la 

la 

ra 

na 


60) 

PD 


sa 

ha 


Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
romanization,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  superscript  dot  (  □  )  called  pulli. 

3.  This  letter  has  the  same  form  as  the  vowel  sign  for  a  appearing  after  a  consonant.  Where 
ambiguity  arises,  it  is  written  $  . 
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CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  Modifiers 

Name 

USMARC  hexadecimal  code 

s 

□ 

acute 

E2 

0 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

□ 

dot  below 

F2 

□ 

underscore 

F6 
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Telugu 


Vowels  and  Diphthongs  (see  Note  1) 


es 

a 

•9 

l 

o 

«5 

a 

e 

a 

i 

& 

e 

•&€ 

•r 

i 

S3 

ai 

u 

a. 

o 

u 

to 

5 

2300 

r 

r 

au 

o 

zx, j° 

r 

o 


Consonants  (see  Note  2) 

Gutturals  Palatals  Cerebrals  Dentals 


sr 

ka 

d5 

ca 

S3 

kha 

■CS 

6a 

ga 

cha 

$0 

gha 

a 

ja 

a 

ha 

— e 

a 

ja 

jha 

a* 

ha 

to 

ta 

• 

<5 

ta 

& 

tha 

• 

tha 

a 

da 

• 

dS 

da 

$ 

dha 

• 

<5 

dha 

£9 

na 

• 

na 

Labials 


Semivowels 


Sibilants 


Aspirate 


& 

pa 

o 3o 

ya 

sa 

Zp 

ha 

£> 

pha 

ra 

& 

sa 

• 

23 

ba 

®9 

ra 

£ 

sa 

£ 

bha 

53 

la 

So 

ma 

la 

ss 

va 

Sunna 

Visarga 

Ardhasunna 

(see  Note 

3) 

(see  Note  4) 

0 

m 

o 

o 

h 

• 

c  ,  fi  m 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed;  the  forms  used  for 
vowels  following  a  consonant  can  be  found  in  grammars;  no  distinction  between  the  two  is 
made  in  transliteration. 

2.  The  vowel  a  is  implicit  after  all  consonants  and  consonant  clusters  and  is  supplied  in 
transliteration,  with  the  following  exceptions: 

(a)  when  another  vowel  is  indicated  by  its  appropriate  sign;  and 

(b)  when  the  absence  of  any  vowel  is  indicated  by  the  superscript  sign  (  £_  )  called 
valapalagilaka. 

3.  Exception:  Sunna  is  transliterated  by: 

h  before  gutturals, 
n  before  palatals, 
n  before  cerebrals, 
n  before  dentals,  and 
m  before  labials. 

4.  Ardhasunna  before  gutturals,  palatal,  cerebral,  and  dental  occlusives  is  transliterated  n. 
Before  labials,  sibilants,  semivowels,  the  aspirate,  vowels,  and  in  final  position  it  is 
transliterated  m. 


CHARACTER  MODIFIERS  IN  ROMANIZATION 


Character  Modifiers 

Name 

USMAh 

6 

acute 

E2 

□ 

circumflex 

E3 

0 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

a 

candrabindu 

EF 

g 

cedilla 

FO 

□ 

dot  below 

F2 

g 

circle  below 

F4 

□ 

underscore 

F6 
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Thai 


Vowels  and  Diphthongs 

Vernacular 

Romanization 

Vernacular 

Romanization 

as,  a 

a 

ans 

ua 

in 

a 

an,  n 

ua 

an 

am 

la,  la,  ay,  lay 

ai 

a 

i 

any 

ai 

a 

TP 

1 

tan 

ao 

'Sf 

a 

ir 

am 

ao 

a 

u* 

ay 

t 

ui 

a 

» 

u 

lay 

5i 

a 

u 

u 

aay 

Qi 

las,  la 

e 

iay 

oei 

1a 

e 

iaay 

irai 

uas,  ua 

32 

any 

uai 

ua 

as 

an 

iu 

las,  a 

o 

lan 

eo 

la 

5 

lan 

eo 

tfns 

9 

nan 

3SO 

aa 

9 

iayn 

Teo 

laas 

CE 

q 

nr 

laa,  1a 

oe 

q 

ri 

IQIIS 

ia 

q 

roe 

iay 

Ta 

q'l 

rtf 

laas 

tfa 

q 

ltf 

iaa 

\xa 

qn 

lir 
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Consonants 


Vernacular 


Romanization 
when  initial 
and  medial 


n 

v,  nj,  r,  n,  *w 

>3 

ft,  ?r,  *3,  ai 

OJ 

o 

PI,  |3,  (see  Note  1) 

13 

n,  j|j,  VI,  cVi,  B,  ai  (see  Note  1) 
U,  04 
tJ 

vl 

u,  w,  n 
tJ,  vJ 
34 
tl 

3  (see  Note  2) 

a,  vr 

*3,  VIS,  rf,  W, 

0  (see  Note  3) 

VI,  6  (see  Note  4) 


k 

kh 

ch 

ch 

y 

d 

t 

th 

n 

b 

P 

ph 

f 

m 

y 

r 

1 

w 

s 

‘  (ayn) 
h 


Notes 


Romanization 
when  final 


k 

k 

ng 

t 

t 

n 

t 

t 

t 

n 

P 

P 

P 

P 

m 

n 

n 

t 


1.  is  usually  romanized  th,  occasionally  d,  depending  on  the  pronunciation  as  determined  from  an 
authoritative  dictionary. 

2.  When  5  follows  another  consonant  and  ends  a  syllable,  it  is  romanized  n,  and  the  inherent  vowel 
of  the  preceding  consonant  is  represented  by  q  (e.g.,  UR5  nakhqn). 

When  51  follows  another  consonant  and  no  other  pronounced  consonant  follows  in  the  same 
syllable,  it  is  romanized  an  (e.g.,  ffallfl  sawan),  but  if  a  pronounced  consonant  follows,  it  is 
represented  by  a  (e.g.,  fl553J  kam). 
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3.  In  four  common  words  0  occurs  preceding  another  consonant  to  mark  a  certain  tone  and  is  then 
not  romanized. 

4.  When  VI  occurs  preceding  another  consonant  to  mark  a  certain  tone,  it  is  not  romanized. 

RULES  OF  APPLICATION 

Romanization 

1.  Tonal  marks  are  not  romanized. 

2.  The  symbol  °1  indicates  omission  and  is  shown  in  romanization  by  " ... "  the  conventional  sign  for 
ellipsis. 

3.  When  the  repeat  symbol  *1  is  used,  the  syllable  is  repeated  in  romanization. 

4.  The  symbol  is  romanized  la. 

5.  Thai  consonants  are  sometimes  purely  consonantal  and  sometimes  followed  by  an  inherent  vowel 

romanized  o,  a,  or  g  depending  on  the  pronunciation  as  determined  from  an  authoritative 
dictionary,  such  as  the  Royal  Institute’s  latest  edition  (1982). 

6.  Silent  consonants,  with  their  accompanying  vowels,  if  any,  are  not  romanized. 

7.  When  the  pronunciation  requires  one  consonant  to  serve  a  double  function-at  the  end  of  one 
syllable  and  the  beginning  of  the  next--it  is  romanized  twice  according  to  the  respective  values. 

8.  The  numerals  are:  o  (0),  o  (1),  lo  (2),  en  (3),  6  (4),  &  (5),  b  (6),  tr)  (7),  d  (8),  and  ^  (9). 

9.  In  Thai,  words  are  not  written  separately.  In  romanization,  however,  text  is  divided  into  words 

according  to  the  guidelines  provided  in  Word  Division  below. 


Word  Division 

1.  In  general,  Thai  words  formulated  by  romanization  are  made  up  of  a  single  syllable  (ph  kham\  IvitJ 
Thai),  and  thus  each  syllable  is  considered  a  separate  word,  with  a  space  between  each.  (Exceptions 
are  covered  by  rules  2  through  43.)  This  is  in  contrast  to  many  multisyllabic  words  that  are  foreign 
to  Thai  but  that  are  found  written  in  the  Thai  script.  These  multisyllabic  foreign  words  are  written 
with  the  space  closed  up  between  the  syllables  as  appropriate.  Prominent  examples  are  words  of 
Pali  and  Sanskrit  origin,  as  well  as  some  words  that  are  Thai  in  origin  but  have  been  formed 
according  to  the  grammar  of  Pali  or  Sanskrit. 
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Examples  of  monosyllabic  Thai  words: 
ph 
IvitJ 

VJfl 


kham 

Thai 

phut 


Examples  of  multisyllabic  Thai  words  formed  according  to  the  rules  of  other  languages: 
nSSVlITJ  krasuang 

muo  phasa 

\JtUOn  panya 


Examples  of  multisyllabic  words  of  Pali  or  Sanskrit  origin: 


watthanatham 

wvmvitmtj 

mahawitthayalai 

kasat 

warna^ 

phonlamuang 

msto 

phanrong 

2.  In  contrast  to  the  preceding  statement  that  Thai  words  are  monosyllabic,  note  that  there  are 
multisyllabic  Thai  words  in  which  the  first  syllable  is  used  as  a  prefix;  write  the  prefix  and  the 
succeeding  element  as  a  single  word. 


(a)  Words  with  ffl1  (kan)  prefixed 
rmmuvm 
mmiJft 
msamlssMrt 


kandcenthang 

kanplie 

kantangprathet 


(b)  Words  with  PHnvi  ( khwam )  prefixed 

nraww 

piTuirarifj 

prmimu 


khwamching 

khwammai 

khwamhen 


(c)  Miscellaneous  cases  with  various  words  used  as  prefixes.  The  most  common  are: 


via 

Ph? 

mu 

nai 

uu 

mm 

U1n3 

nang 

luk 

m 

khQ 

VJ 

phi 

PH 

kham 

uonj 

ngng 

\m 

bot 

ALA-LC  Romanization  Tables 


page  203 


Thai 


& 

VJ 

phu 

khrirang 

un 

nak 

uuu 

phien 

RU 

khon 

VIQnJ 

V 

thgng 

■0YJ 

chao 

u 

khT 

tan 

dek 

to 

chai 

in 

kha 

chang 

rai 

in 

wai 

siu 

ton 

stn 

duang 

sny 

lai 

vma 

mg 

Note  that  the  listing  of  miscellaneous  cases  covers  only  the  most  common  ones.  Other,  less 

common  cases  may  be  handled  in  the  same  way. 

UCU  bun 

» 

Note  also  that  some  words,  reflecting  a  transposition  in  normal  Thai  word  order  and 
consisting  of  a  principal  word  and  one  or  more  words  used  with  it  as  a  prefix,  may  be 
considered  as  belonging  to  this  category. 

IWUfUTlfltJfliaau  Ratchabandittayasathan 

borannakhadlwitthaya 

IvitJfiatfntri  Thaikhadlsirksa 


3.  Compounds.  Write  compounds  as  a  single  word. 

(a)  General:  Identifying  a  compound  is  largely  a  matter  of  knowledge  of  the  language  together 
with  the  use  of  good  judgment.  Note  that  compounds  generally  result  in  concepts  to  one 
degree  or  another  different  from  the  meaning  of  either  of  the  component  words  when  used 
alone. 


klaichit 

auvn 

khonha 

khemnguat 

thotlgng 

mutton 

phainQk 

\j 

mgngdu 

» 

chatchen 

acwu 

‘  otthon 

fnniT) 

chuakhrao 

klangchieng 

nSivnaj 

o 

klahan 

niucnu 

khropthuan 
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ifStnijaNj 

kieokhgng 

1  y 

raunu 

u 

muban 

rotfai 

Nuifiau 

ngoenduan 

%ivn 

chattham 

«A* 

rapchai 

uanni 

bQkrap 

wa5 

u 

tQSU 

nj  mu 

ropkuan 

tfavntJ 

t  ^ 

sutthai 

wall 

phonlamai 

%  v 
itfam 

sirapha 

RU134 

ton  ‘  eng 

VJAfltJ 

liJmmj 

phutkhui 

paiyiam 

tlphim 

chatphim 

N.B.:  Do  not  consider  as  compounds  repeated  words  sometimes  referred  to  as  "reduplicated 
compounds." 

t 

fTN  1  tang  tang 

tn  *1  cha  cha 

tn  H  reo  reo 


(b)  Reduplicated  Doublets:  When  words  are  strung  together  for  sound,  write  them  as  a  single 
word. 


njStouuJtN 

iluuTau 

amn 

saunau 


pllanplieng 
rTaprgi 
*  u'kkathu'k 
rQpkhgp 


4.  Geographical  Names.  Divide  geographical  names  into  separate  words  according  to  the  decisions 
of  the  U.S.  Board  on  Geographic  Names,  as  expressed  in  the  Gazetteer  of  Thailand. 


5.  Royal  Language  (rachasap).  Write  as  a  separate  word  any  word  found  as  an  entry  in  either  of  the 
following  dictionaries  unless  otherwise  directed  by  these  rules. 

Rachasap  chabap  sombun  /  Rachamanop.  (Bangkok:  Kaona,  1965) 

Rachasap  chabap  sombun  /  Sanguan  ‘Ankhong.  (Bangkok:  Kaona,  1964) 
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6.  Generally  separate  all  elements  in  terms  of  rank,  privilege,  address,  etc., 


persons  unless  another  rule  directs  otherwise. 

inavRnssns'W 

u  t  y 

WjawrrtTNrf 

SQNjQWVlStvi 

wfMnnwi 

viwwvn 

HNlJ«1hin8tj5 

w5:\Jiwuvmjvn 

WKimmmyHrcfari 

fltmwnjSTflrom 

v  ^ 

wstuYJi^iwssiJWjriffuinn 


associated  with  names  of 

Naithahan  Tamruat 
Phuchuai  Sattrachan 
Mgm  Ratchawong 
Rgng  ‘  Amat  Th5 
Phon  Tamruat  ‘Ek 
Phra  Maha 

Rgng  Prathanathibgdi 
Phraparaminthra  Maha 
Phratheprattanaratchasuda 
Sayambgrommaratchakuman 
Phrabat  Somdet  Phrachaoyuhua 
Phranang  Chao  PhrabgrommarachinT  Nat 


One  arbiter  of  whether  official  pronunciations  of  royal  ranks  with  D53J  {Krom)  require  an  extra 
syllable  is  Photchananukrom  nakrJan  chaloem  phrakiat,  Phq.  Sq.  2530 ,  2nd  edition,  1988,  p.  329. 
This  officially  approved  work  is  based  on  proclamations  of  the  Prime  Minister’s  Office  as  well  as 
on  the  Royal  Institute’s  1982  dictionary. 

nSJJVJlstn  Krom  Phraya 

but  Krommaluang 


7.  Generally  combine  elements  (covered  by  terms  described  in  6.  above)  which  begin  with  l^Tl  (£hao) 


<  v  _  _  V' 

and  combine  elements  in  vnuuviEIN  ( Thanphuying ),  l'WJQ3J3J35fn  (i Chaochqmmanda ),  and 

ay  ^ 


1 

VJSSlTiQtjU'l  ( Phrachaoyuhua ) 

Chaonai 

Chaokhgng 

v  y 

iTivh 

Chaofa 

iWltttn 

Chaophraya 

vnuuviaiNS 

Thanphuying 

iwownsfh 

Chaochqmmanda 

u 

Phrachaoyuhua 

but 

o 

Chaofa  Ying 

i<hvJ,wntj 

Chaofa  Chai 

Phra  *  ong  Chao  Chai 
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vmauwiak 

o 

Mgm  thao  Ying 

thao  Mirang  Prie 

Generally  separate  elements  in 

personal  royal  titles  and  corporate  names  beginning  with  D53J 

(. Ki-om )  except  as  listed  below. 

For  these  excepted  cases  only,  although  pronunciation  with  or 

without  the  extra  syllable  are  correct  in  popular  usage,  proclamations  of  the  Prime  Minister’s  Office 
in  recent  years  have  approved  pronunciations  with  the  extra  syllable  (effectively  combining  the 
elements)  as  the  only  officially  correct  pronunciations. 

nSJJW?2 

nwnifrN 

imnimi 

Krommaphra 

Krommaluang 

Krommamu'n 

nwm 

Krommawang 

n»jvn 

Krommatha 

but  majwsstn 

Krom  Phraya 

Keep  separate  the  Buddhist  samanasak  ranks  VJ5SJJVH  (Phra  Maha)  and  (Phra  Khru )  as 

well  as  the  conventional  religious  terms  of  address  W52  (Phra)  and  (Phra  ‘ Achan ). 

9 

Phra  Maha  Prayut 

wssniiik 

u 

Phra  Khru  Lost 

VJStffoJVrfo 

Phra  Somwang 

Phra  ‘Achan  Somwang 

(Phra)  should  be  separate 

as  follows: 

(a)  When  connoting  the  Buddha,  his  image,  a  member  of  the  Buddhist  Order  of  the  Sangha,  etc.; 

(b)  As  the  only  element  or  one  of  the  elements  in  a  separately  written  conferred  rank,  or; 

(c)  As  an  independent  (complimentary,  conventional,  etc.)  element  immediately  preceding  a 

personal  name  (except  the  name  of  Buddha)  but  not  an  integral  part  of  it.  Otherwise,  it 
should  be  written  as  a  combined  prefix  to  words  (usually  associated  with  royalty,  gods,  objects 

or  worship,  etc.). 

wisuniwsnfl 

Phra  Kaso  Mgrakot 

Phra  Sayammathewathirat 

Phra  ‘Achan  Somwang 

wsrasmim 

Phra  ‘AphaimanI 

vjszwvntbsimtj 

t 

Phra  Maha  Prayut 

wisfisiiW 

Phra  Khru  Last 

nfljwszsjviflfl 

Krom  Phra  Mahidon 

ALA-LC  Romanization  Tables 


page  207 


Thai 


vj  %  swvifitjaa  W^vntan 

t 

wisuWiSwiaam 

u 

flua^wsnismiiiu 
but  wsrvMvim^n 

t 

muwsruwia 

nsjjwsrrrtrwin? 

wsrfejnsr 


Phra  PutthayQtfa  Chulalok 
Phra  Nangklao  Chaoyuhua 
Somdet  Phra  Wannarat 
Phraphutthachao 
Phraparaminthra  Maha  Mongkut 
Krommaphra  Mahidon 
Krom  PhraratchawangbgwQn 
Phrasangkharat 


11.  3ivn  (Maha)  should  be  separate  when  as  an  independent  (complimentary,  conventional,  etc.) 
element  it  immediately  precedes  and  is  not  an  integral  part  of  a  personal  name.  Otherwise,  it 
should  be  treated  as  specified  in  paragraph  12.  below. 

‘Wir'lJlUUVl^JVI'UJ^nQ  Phraparaminthra  Maha  Mongkut 

Phraparaminthra  Maha  Phumiphon 
‘  Adunlayadet 

UVnthfl  mahathat 


12. 


VJIsilvn  (Phra  Maha)  is  a  samanasak  rank  conferred  on  a  lower-level  Buddhist  religious  who  has 
finished  at  least  the  third  grade  of  parTan  (Buddhist  theology).  It  should  not  be  confused  with 
UVYV  (maha-)  or  Vt^UVTi-  (phramaha -),  which  are  prefixes  added  to  words  associated  with  royalty, 
religious,  gods,  objects  of  worship,  etc.,  such  as  titles  of  the  supreme  patriarch.  The  rank  should 
be  written  separately;  the  prefix,  combined. 


wsruvnibraviij 

t 

wsriJvntfawiTam 

uvnimr 


Phra  Maha  Prayut 
Phramahasangkharat  Chao 
Phramahasamanachao 
Mahathera 


13.  Titles  of  honor  with  I'Vl  (chao)  as  the  last  element  should  be  written  separate  or  combined  in 
accordance  with  whether  the  immediately  preceding  element  is  commonly  used  as  a  prefix. 

Phra  ‘  ong  Chao 

VJ  5  2^*34 13211^3  Phrasangkharat  Chao 

VJSSVJVlfil'Sn  Phraphutthachao 

VJ?tJJVi3?luaU^n  Phramahasamanachao 


wwim^n 

W5Sl^T)rN^LBOW5SfNm5n 


Phranang  Chao 

Phrachao  \V9rawongth0e  Phra  ‘  ong  Chao 
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14. 


15. 


Combine  {Luang)  when  used 

as  a  simple  (non-conferred)  title 

words  for  relatives  supposed  to  be  Buddhist  priests. 

v^miAia 

4, 

Luangphg 

ttfTNVI 

LuangphI 

vranwatJ 

u 

Luangphgpu 

mi-tm 

t 

Luanglung 

Generally  separate  PiCU  {Khun)  as  first  element,  except  in  the  courtesy 
and  FifUVnfJ  {Khunnai). 

flttUm 
»  , 

Khun  Mie 

fltuwa 

t 

Khun  Phg 

flfUVIU 

t  u 

Khun  Nu 

but  Rcumh 

t  o 

Khunying 

Raima 

t 

Khunnai 

16.  Combine  elements  in  personal  pronouns  or  their  equivalents. 

ITYHIsyJVlfil^n  khaphraphutthachao 

khaphachao 

ViUaW'UU  mgmchan 

nistJU  kraphom 


taifala  ‘  gngtullphrabat 


17.  Elements  in  royal  or  noble  titles  and  names  are  generally  combined  but  should  be  separate  when 
two  distinct  terms  can  be  isolated  in  a  noun/modifier  or  noun/noun-in-apposition  configuration  if 
the  terms  would  not  otherwise  be  combined,  or  if  there  is  doubt  about  whether  to  combine  them. 

Damrongrachanuphap 

‘Ul^ShVlVlQNj  Prasat  Thgng 

Phutlhaygtfa  Chulalok 


v'jvmaavifnumatj 

t 

enniiu 


Phutthaloetla  Naphalai 
Tak  Sin 


18.  Elements  ordinarily  combined  should  be  separate  when  immediately  followed  by  a  proper  name  or 
its  substitute. 

imvlvAvicy  Miethap  Yai 

Rgng  Mie  K§ng  Ngan  Phrathammathut 
l^na'lTlfll  chao  *  awat 

rJ  Chao  Khana  Phak  7 
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inuflfen« 

t 

miWiiJi 

mUJCN 


Nai  Suchat 
Chao  Mirang  Pne 
chao  mirang 


19. 


U1U  (Nai)  before  a  proper  name  as  a  bandasak  or  roughly  equivalent  to  Mister  should  be  separate. 
Otherwise,  it  is  combined  as  a  prefix  in  accordance  with  paragraph  2.(c)  above, 
tnwtarii  Nai  Suchat 


inasatj 

mtmvmtnsTSi 
immvimJlkaij 
u^wnfwVwi  (fa) 


Nairgi 

Naithahan  Tamruat 

Naiphmt  Pratle 

Nai  Phithakracha  (Cho) 


20. 


In  cases  of  doubt  about  whether  miscellaneous  words  listed  under  2.(c)  above  are  functioning  as 
prefixes,  generally  separate  elements  unless  sufficient  conventional  usage  has  bestowed  word  status 
on  the  configuration.  Always  separate  demonstrative  adjectives,  relative  pronouns,  and  numerals 
from  the  first  element. 


nuvlunn 

t 

unomsrtti 

RUUU 

ulm 


but  fluihvn 


khon  phon  khuk 
phukgkanrai 
khon  nan 
phulTphai 
khrirang  nT 
khQ  nirng 
khonsTthao 


21.  Combine  elements  in  colors,  days  of  the  week,  but  not  months  of  the  year. 

sllirang 
Wanchan 

Diran  Phrirtsachikayon 


ffalflfN 

i@auwc|rf^miju 


22. 


Combine  numerals  in  the  same  way  as  in  English;  i.e.,  combine  teens,  combine  first  elements,  and 
separate  from  the  digital  element  if  any  in  twenty  through  ninety-nine,  etc.  Separate  Vl  (thl)  in 
ordinals. 


itaehu 

?auiatn 


sipsam 
yTsip  sgng 
rgi  ‘et 

wan  thT  sipchet 
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23. 


Generally  separate  royal  introductory  verbs  and  verbs  indicating  royal  courtesy,  but  combine 
elements  in  conventional  phrases  of  courtesy. 


vimhrnaiJVjmiJfl 

viMvittwifu 

o 

t 


song  prakgp  phithi  poet 
song  phracharcen 
song  phrakaruna  prot  klao 
khgpchai 


24. 


In  most  cases  separate  ViSnu  (lai),  "dl  ( chua ),  VH 


8134 


(tang),  etc.,  preceding  words. 

Vl'nJssivrt 

vnsTaammm 


irNnm 

t 

vinlfa 

t 

amroen 

vrenupru 

but  ?nnro 

VlYllJ 


VlNjViJJfl 


X 

(thua),  nn  (thuk).  Visa  (thang),  tm  (bang), 

thang  prathet 

thua  ratcha  ‘  anachak 

chua  chTwit 

bangkhrao 

thuk  sing 

tang  manda 

lai  khon 

chuakhrao 

thuapai 

thangmot 

chuamong 

tangprathet 


25. 


Generally  separate  43U  (ngan)  preceding  a  word  or  phrase  except  in  cases  denoting  literary  or 
research  activity  when  only  two  elements  are  present. 

43UVI?^1i3'2fvnuiVMfl4fl'‘lAl  ngan  phraratchathan  ploeng  sop 

ngan  thurakit 

TlUlBtJU  ngankhian 


nganwichai 


26.  Combine  or  separate  the  following  miscellaneous  phrases  as  shown. 


rita'jlu 

nirang  nai 

imlnu 

kha3  nai 

'inentj 

waduai 

fhwm 

luangna 

n^inu 

ruam  kap 

V' 

tuaNjei  u 

birangton 

pisuiau 

khrop  r§p 

fialil 

V'  | 

tQpai 

vjiawihtj 

phrgm  duai 

{14118 

1 

tangtas 

piutaNj 

ton  ‘  eng 

^unisviNj 

chonkrathang 

Aa^'in 

nirangchak 

%tn4 

chon  thung 
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27.  Combine  91Z7U  ( tawan )  not 
flsnuaonAtNmua 
sinuflniiWiin 
anuanmiNMua 
anuoanmoNAa 


(chiang)  in  compass  points. 

tawan ‘Qk  chiang  nira 
tawantok  chiang  tai 
tawantok  chiang  nira 
tawan ‘Qk  chiang  tai 


28. 


Generally  separate  geographical  designations  with 

ma^mua 

iSa^a 

but  madman 


OKM  ( mxtang ). 
mirang  nira 
mirang  tai 
mirangnQk 


29.  When  fiTUI  (khwam)  is  used  in  the  legal  context  meaning  "case"  or  "matter",  it  should  be  separate. 
RT&IUVM  khwam  phmng 

RT)34a7cy7  khwam  ‘  aya 


30.  Those  elements  equivalent  to  English  hyphenated  adjectival  phrases  (not  clauses)  should  be 
combined  although  kept  separate  from  any  noun  (not  included  under  paragraph  2.(c))  that  they 
might  modify,  the  latter  clause  applying  only  to  phrases  that  are  clearly  adjectival  in  nature  and  not 
including  the  phrase  words  whose  role  Thai  syntax  makes  doubtful. 


vuwftoUnffom 

tsasnuuhma 

\Ajaisn?lN30on 

miiwnmifloviamhtj 


nangsir  poksikhao 
rira  chaifangthale 
khaopradapdin 
sinkhasong*  Qk 
phQkhasong*  Qk 
kanchenracha  kankha  laifai 


31. 


Many  phrases  consisting  of  predicate  and  object  have  attained  single  word  status.  But  if  there  is 
doubt  that  the  elements  have  attained  single  word  status,  do  not  combine  them  unless  30.  above 
applies. 


vrmu 

'miuu 

a^owi 

v  * 

but  mstnuriu 

y  I 

mitnofiujpu 

fiulei 


pramcenphon 
thamngan 
wangphmn 
‘  uaiphQn 
kanyai  thin 
kanyai  thinthan 
thin  tai 
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32.  Generally  separate  phrases  with  fhu  ( suan ). 
flhuwttow 
fftuvi&iucN 
but  shuen 

fhunu 


suan  phra‘ong 
suan  phonlamirang 
suantua 
suanruam 


33.  Phrases  with  words  indicating  buildings  as  first  element  are  generally  combined. 


hQsinlapa 

mnJsswj 

hgngprachum 

^  t 

tl'SWUW 

rongphim 

mttm 

hgsamut 

1*31  SOU 

rongrlan 

34.  For  terms  not  of  rank,  privilege,  address,  etc.,  generally  do  not  combine  elements  that  might 
otherwise  be  candidates  for  combining  when  the  second  element  is  a  proper  name  unless  the  first 
element  appears  in  the  list  under  2.(c)  above.  Nevertheless,  capitalize  the  first  element  in  any  case. 
‘HOriJJWfl  Hq  Phumiphon 

StutlUfl  Rot  Ben 

flJJfiSVvIlsnvi  KhamphT  Phrawet 

but  RUlviiJ  Khonthai 


35.  For  phrases  consisting  of  a  verb  and  4'VU  ( ngan ),  generally  do  not  combine. 
ltofhU>OTU  prasan  ngan 

TAITtU  ruam  ngan 

\j£jU61^U  patibat  ngan 

o 

but  VHTIU  thamngan 


36.  1Q3-J  (ruam)  as  initial  or  final  element  should  generally  be  separate. 

» 

ruam  lem 

•  V  _  _ 

nsJOUentJ  ruam  yu  dual 

,  u  _ 

but  SI7U513J  suanruam 


37.  Keep  separate  elements  that  purport  to  be  translated  phrases  from  other  languages. 

✓ 

IMS' NakhQn  Wat 


titnuflfums 


Sayam  Sakhwie 


ALA-LC  Romanization  Tables 


page  213 


Thai 


38. 


39. 


40. 


41. 


42. 


Generally,  when  applying  word  division  rules,  retain  word  division  appearing  in  the  Thai  script  only 

if  there  is  doubt  about  the  exact  mean  of  the  configuration.  Otherwise,  ignore  the  division  in  Thai 

script. 

vmo-lwj  iJlvissrtu 
? 

Khrongkan  Phut-Thai  Parithat 

» 

Phrases  beginning  with  U-'i  (««)  should  only  be  combined  if  convention  has  conferred  word  status 

on  the  configuration. 

iruSwnti 

nasTadai 

influx 

nasonchai 

but  rn^utn 

na  chap  ta 

U15  U5Qn3 

na  raprgng 

Separate  words  beginning  with  WHO?  ( thahan )  for  military,  etc. 

vivmun 

thahan  bok 

vnmso 

thahan  rira 

vnn«nmrf 

thahan  ‘akat 

o 

vnrtsfnrw 

thahan  tamruat 

Combine  the  generic  element  with  all  elements  of  a  name  for  plants,  animals,  vegetables,  etc. 

ummvia'3 

f 

nokkhunth$ng 

ufNrm 

tiengkwa 

t  ^ 

sunakchingchhgk 

wnrmvn 

pakkathua 

srnmnn 

tonmak 

For  diseases,  combine  the  generic  term  tSfl 

( rok )  only  if  it  appears  as  the  final  element  of  the 

configuration. 

kanlarok 

miAw 

kammarok 

I’Spiijri.fsa 

rok  mareng 

lintavifluatj 

rok  lohit  ngi 

hmmw 

r5k  thgraphit 

aymeinba 

‘ahiwatakarSk 

IwtJerw 

rok  fldat 
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43. 


Combine  adjectival  phrases  with  1^1  (chat)  as  final  element. 


iouta 

fllwnWta 


rQnchai 

krengchai 

sukkhaphapchai 

dichai 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

A 

digraph  AE  (upper  case) 

A5 

CE 

digraph  OE  (upper  case) 

A6 

ir 

U-hook  (upper  case) 

AC  " 

« 

ayn 

BO 

a3 

digraph  ae  (lower  case) 

B5 

ce 

digraph  oe  (lower  case) 

B6 

vc 

u-hook  (lower  case) 

BD 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

r-i 

LJ 

macron 

E5 

X, 

LJ 

hachek 

E9 

g 

right  cedilla 

F8 
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Tibetan 

Internal  capitalization  of  base  consonants  is  not  to  be  followed.  When  Tibetan  is  written  in  another 
script  (e.g.,  ’Phags-pa)  the  corresponding  letters  in  that  script  are  romanized  according  to  this  table. 

Vowels  and  Diphthongs  (see  Notes  1  and  2) 


w 

[T 

•s. 

w 

is* 

w 

IT 


a 

a] 

i 

r] 

u 

U] 


V 


S’ 

o 

^  ’ 
1ST 


[ 

bt 

[sr 


f] 

n 

ii 

e 

ai] 

o 

au] 


Consonants  (see  Notes  2  and  3) 


Gutturals 

Palatals 

Cerebrals 

Dentals 

V 

ka 

z  ca 

r 

ta] 

r 

ta 

i**’ 

kha 

5'  cha 

[*• 

tha] 

y 

tha 

T 

ga 

E’  ja 

da] 

V 

da 

[»' 

gha] 

[%'  jha] 

[*• 

dha] 

[S’ 

dha] 

E’ 

na 

y  na 

[  r 

na] 

V 

na 

Labials 

Affricates 

Semivowels 

Sibilants 

*)• 

pa 

t' 

tsa 

wa 

<y 

za 

*)  • 

pha 

tsha 

<V 

’a 

(see  Note  4) 

3 • 

za 

q* 

ba 

Z’ 

dza 

or 

ya  (see  Note  5) 

y 

sa 

[r 

bha] 

[r 

dzha] 

v 

ra 

* i • 

sa 

ma 

la 

[p’ 

sa] 

Aspirate 


V 


ha 


Anusvara 


m 


Anunasika 

[fi  m] 


Visarga 
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Notes 

1.  Only  the  vowel  forms  that  appear  at  the  beginning  of  a  syllable  are  listed.  A  syllable  is 
defined  as  a  graph  or  group  of  graphs  followed  by  a  tsheg  (□).  The  forms  used  for  vowels 
following  a  consonant  can  be  found  in  grammars. 

2.  Letters  enclosed  within  brackets  in  the  table  are  used  in  Sanskrit  words. 

3.  The  vowel  a  is  implicit  after  all  consonants  in  independent  form  and  is  supplied  in 
romanization,  unless  another  vowel  is  indicated  by  its  appropriate  sign.  Tsheg  (□),  the 
syllabic  boundary  marker,  is  represented  by  a  hyphen  in  proper  names  and  by  a  space  in  other 
words. 

4.  The  ’a  chuh  is  represented  by  an  apostrophe  (  ’  )  in  pre-initial  and  final  position  unless 
marked  with  a  vowel  marker,  in  which  case  it  is  represented  by  the  apostrophe  plus  the 
appropriate  vowel.  When  ’a  chuh  is  written  below  any  letter  representing  vocalic  length,  it 
is  romanized  according  to  the  vowel  table. 

5.  When  «i'  is  preceded  by  or ,  it  is  romanized  g'y  to  distinguish  it  from  3’  which  is  romanized 
gya. 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

) 

apostrophe 

27 

/ 

soft  sign  (prime) 

A7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

fi 

candrabindu 

EF 

□ 

dot  below 

F2 

g 

circle  below 

F4 

□ 

underscore 

F6 
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Tigrinya 


Syllables 


1st  Order 

2nd 

Order 

3rd  Order 

4th  Order 

5th 

Order 

6th  Order 

7th  Order 

0 

ha 

U* 

hu 

Y. 

hi 

y 

ha 

t 

he 

I! 

he  or  h 

ir 

ho 

A 

la 

A* 

lu 

A. 

li 

A 

la 

tu 

le 

A 

le  or  1 

A* 

lo 

ih 

ha 

ih* 

hu 

<h. 

hi 

* 

ha 

«ft» 

he 

/h 

he  or  h 

lb 

ho 

CO 

ma 

fltH 

mu 

mi 

ma 

me 

y 

me  or  m 

7° 

mo 

u> 

sa 

»P* 

su 

✓  • 
si 

•u 

sa 

•** 

se 

/* 

se  or  s 

V* 

so 

l 

ra 

4* 

ru 

6 

ri 

in 

ra 

4 

re 

c 

re  or  r 

e 

ro 

A 

sa 

A* 

su 

tu 

si 

A 

sa 

A. 

se 

A 

se  or  s 

A 

so 

n 

V 

sa 

ft 

V 

su 

ft. 

v  • 

SI 

n 

v  — 

sa 

ft 

V  / 

se 

•fi 

se  or  s 

¥ 

so 

+ 

qa 

* 

qu 

4* 

qi 

qa 

4? 

qe 

* 

qe  or  q 

* 

qo 

qa 

qu 

5 

qi 

? 

qa 

qe 

* 

qe  or  q 

¥ 

qo 

n 

ba 

ft 

bu 

fl 

bi 

n 

ba 

ft. 

be 

•ft 

be  or  b 

n 

bo 

+ 

ta 

* 

tu 

•t 

ti 

;>• 

ta 

* 

te 

te  or  t 

to 

¥ 

V 

ca 

cu 

•*£ 

V* 

Cl 

? 

v — 

ca 

¥ 

V  / 

ce 

¥ 

ce  or  c 

¥ 

V 

CO 

ft 

ha 

•V 

hu 

hi 

ha 

ft. 

he 

ft 

he  or  h 

•9 

40 

ho 

V 

na 

V* 

nu 

i 

ni 

<? 

na 

1 

ne 

ft 

ne  or  n 

no 

7 

na 

? 

nu 

Tt 

ni 

na 

7 

ne 

ne  or  n 

? 

no 

A 

’a 

it. 

’u 

A- 

’i 

A 

’a 

ft 

’e 

A 

’e 

’o 

h 

ka 

ft 

ku 

ft 

ki 

h 

ka 

lb 

ke 

to 

ke  or  k 

to 

ko 

xa 

Tt* 

XU 

*ft 

xi 

*fi 

xa 

•ft 

xe 

•fi 

xe  or  x 

•fi 

xo 

0 

wa 

to. 

wu 

<2 

wi 

9 

wa 

we 

o* 

we  or  w 

ft 

wo 

0 

‘a 

ft 

‘u 

ft 

‘i 

ft 

‘a 

ft 

‘e 

A 

‘e 

r 

‘o 

H 

za 

ft 

zu 

ft 

zi 

H 

za 

H. 

ze 

ft! 

ze  or  z 

H 

zo 

7f 

V 

za 

V 

V 

zu 

ftC 

v  • 

Zl 

IT 

v  — 

za 

Tf 

V  x 

ze 

3f 

V  V 

ze  or  z 

ftT 

V 

zo 

% 

V 

za 

PS 

V 

zu 

ftC 

V  • 

Zl 

PC 

v  — 

za 

pc 

V  / 

ze 

ze  or  z 

& 

zo 

> 

- 

- 

- 

- 

Tf 

ze  or  z 

- 

r 

ya 

yu 

ft 

yi 

p 

ya 

CL 

ye 

6 

ye  or  y 

p* 

yo 

£ 

da 

V 

* 

du 

V 

ft 

di 

• 

* 

da 

v  — 

r® 

ft 

de 

ft 

de  or  d 

V  V 

* 

do 

V 

£ 

ga 

* 

gu 

ft 

gi 

* 

ga 

£ 

ge 

£ 

ge  or  g 

go 

7 

ga 

ft 

gu 

ft 

gi 

ga 

% 

ge 

1 

ge  or  g 

ft 

go 

m 

ta 

m* 

tu 

• 

ti 

**1 

ta 

au 

te 

V 

te  or  t 

(n 

to 

A 

ca 

at 

cu 

ai. 

ci 

ca 

atr 

ce 

-P 

ce  or  c 

Q* 

CO 

A 

Ea 

A. 

Pa 

ft 

Ri 

A 

Pa 

ft 

Re 

ft 

pe  or  p 

A 

po 

X 

sa 

su 

ft 

si 

A 

sa 

ft 

se 

ft 

se  or  s 

fi 

so 

0 

sa 

ft 

su 

1 

si 

? 

sa 

% 

se 

A 

se  or  s 

so 

* 

fa 

*• 

fu 

it 

fi 

4- 

fa 

ft 

fe 

S 

fe  or  f 

£ 

fo 

T 

Pa 

T 

pu 

T 

Pi 

Pa 

T 

pe 

T 

pe  or  p 

r 

po 

7J 

va 

ft 

vu 

ft 

vi 

ft 

va 

ft 

ve 

ve  or  v 

7? 

VO 
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Combinations  with  w : 


1st  Order  2nd  Order  3rd 


qwa 

£■ 

$ 

qwa 

<h 

•V* 

hwa 

•X 

b- 

kwa 

X. 

xwa 

Th. 

> 

gwa 

X 

Order 

4th  Order 

5th  Order 

qwi 

qwa 

* 

qwe 

qwi 

J  qwa 

* 

qwe 

hwi 

^  hwa 

* 

hwe 

kwi 

\\  kwa 

kwe 

xwi 

xwa 

'V 

xwe 

gwi 

^  gwa 

> 

gwe 

6th  Order  7th  Order 

qwe 

qwe 

•y*  hwe 

kwe 
xwe 
T>  gwe 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 

Special  character 

Name 

USMARC  hexadecimal  code 

alif 

AE 

< 

ayn 

BO 

Character  modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

tilde 

E4 

□ 

macron 

E5 

□ 

dot  above 

E7 

hachek 

E9 

□ 

dot  below 

F2 

□ 

underscore 

F6 
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Uighur 


Letters  of  the  Alphabet 


Initial 

Medial 

Final 

Alone 

Romanization 

L 

L 

L 

1  4  Is. 

a 

- 

b 

mJ 

jf 

V 

P 

J 

CJ 

t 

P* 

e- 

c 

j 

S* L. 

r 

5 

ch 

6- 

C 

kh 

x 

d 

J 

j- 

V 

r 

J 

> 

j- 

J 

z 

3 

3r 

> 

1) 

zh 

m+Jj 

c>~- 

o- 

s 

mJjJ 

-cix. 

i_p- 

sh 

S 

A 

e- 

l 

gh 

3 

A. 

«_a 

f 

£ 

i. 

0» 

q 

viL 

vJ 

k 

uL 

«& 

g 

2 

s. 

viL 

ViJ 

ng 

J 

i 

J- 

J 

1 

JO 

A. 

f- 

? 

m 

mJ 

-L 

v>- 

o 

n 

3 

i- 

> 

J 

V 

J2> 

■e 

<X  « 

o  <  & 

h 

mJ 

*3 

y 
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Vowels 


Initial 

Medial 

Final 

Alone 

Romanization 

b. 

L 

L 

b. 

a 

<50 

a  (  <x 

a  t  <\ 

<50 

a 

i  a. 

- 

*3 

«3  ‘  dr 

i 

-i  i  -H 

f 

^  ‘  ^ 

e 

*• 

3- 

y- 

3* 

3  *  3s 

o 

«• 

5- 

V 

3- 

V  Vf 

3  ‘  3-1 

6 

*  «. 

» 

^  *  * 

3-1 

3- 

3- 

3  ‘  3“* 

u 

1  * 
y* 

'y- 

'> 

1  1  * 

3  ‘  3-< 

ii 

RULES  OF  APPLICATION 


Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 
1.  Romanize  initial  followed  by  a  vowel  as  y. 


Yakhshi 

yoldash 

yeqin 

yigana 


i  ml 


Romanized  medial  ^  followed  by  a  vowel  as  i. 


airoport 

oiman 


Romanize  medial  preceded  and  followed  by  a  vowel  as  y. 


ayat 

ayyam 

oyush 


gAjIi 
*  * 


Romanized  medial  preceded  and  followed  by  ,3  as  y. 


qiyin 

jiyim 

yiyim 


pUL^-iS^ 
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5.  Romanize  medial  ^5  preceded  by  a  consonant  and  followed  by  a  vowel  as  y. 


aghyar 

ashya 


I  .1 


6.  Romanize  final  ,3  and  as  i. 

muqami 

tili 

ai 

qarghai 

tarikhi 

7.  Romanize  ^  when  followed  by  ^  as  n. 

bilishing 

bilishinga 

jang 

janga 

shojang 

shojanga 


*3^3-° 


t «l  i i\,  iL. 

A aU  A  1  i J 


Vills» 

<^U 


*  ^  ^  * 1 


8.  Do  not  romanize  the  jazm  (  ->  ). 

turk 

qirq 

ministr 

9a.  Romanize  the  diphthong  as  ai. 

airoport 

bailiq 

9b.  Romanize  the  diphthong  as  ai. 

baitulla 

paishanba 

daifu 


<3^ 


^3*3^ 11 

^  i  1  i  iL 


^-9u  Oi 


10.  Use  the  single  prime  )  to  separate  two  letters  representing  two  distinct  consonantal  sounds  when 
the  combination  might  be  read  as  a  digraph. 


uz'hal 


Jlo 


,J3J 
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11.  Romanize  foreign  words  that  occur  in  an  Uighur  context  and  are  written  in  uighur  letters  according 
to  these  rules  for  romanizing  Uighur. 

Vena  ( not  Vienna)  Lnji 

jughrapi  ( not  geography)  ^1  Jz  y* 

sitain  ( not  stein) 

12.  Follow  the  rules  for  the  capitalization  of  English. 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

breve 

E6 

□ 

umlaut  (dieresis) 

E8 
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Ukrainian 


Vernacular  Romanization  Vernacular  Romanization 


Upper  case  letters 

A 

B 

B 

r 

r 

a 

E 

e 

>K 

3 
M 
I 

1 

H 

K 

n 

M 

H 

o 

n 

p 

c 

T 

y 

$ 

x 

il 

4 
HI 

m 

b 

K) 

X 


A 

B 

V 
H 
G 
D 
E 

ffi 

ZTi  (see  Note  1) 
Z 

Y 

I 

•• 

1 

I 

K 

L 

M 

N 

O 

P 

R 

S 

T 

U 

F 

Kh 

(see  Note  2) 
Ch 
Sh 
Shch 

1  (soft  sign) 

TD 

!A 


Lower  case  letters 

a 

6 

B 

r 

r 

a 

e 

e 

x 

3 

M 

i 

I 

K 

n 

M 

H 

O 

n 

P 

c 

T 

y 

<t> 

X 

U 

H 

in 

LU 

b 

K) 


a 

b 

V 

h 

g 

d 

e 

ie 

zB  (see  Note  1) 
z 

y 

i 

I 

v 

l 

k 

1 

m 

n 

o 

P 

r 

s 

t 

u 

f 

kh 

ts  (see  Note  2) 

ch 

sh 

shch 

1  (soft  sign) 

1U 

o 

ia 
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Notes 

1.  The  ligature  is  necessary  to  distinguish  x  from  the  combination  3r. 

2.  The  ligature  is  necessary  to  distinguish  u  from  the  combination  tc. 

SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

/ 

soft  sign  (prime) 

A7 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

□ 

breve 

E6 

□ 

umlaut  (dieresis) 

E8 

6 

ligature,  1st  half 

EB 

0 

ligature,  2nd  half 

EC 
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Urdu 

(in  Arabic  Script) 

Letters  of  the  Alphabet 

Initial 

Medial 

Final 

Alone 

Romanization 

1 

L 

L 

1 

omit  (see  No 

J 

ex 

b 

mJ 

v 

LJ 

P 

J 

a. 

Cx. 

di 

t 

u 

j® 

J® 

4® 

t 

mJ 

"*• 

LJ 

5 

ci^ 

cl 

t 

- 

- 

— 

t 

J 

Clx. 

CL 

j 

l 

L*. 

di 

s 

2? 

i*v. 

£- 

C 

j 

5V. 

r 

S 

c 

5V. 

c- 

c 

h 

5V. 

6 

c 

kh 

X 

d 

% 

4- 

X 

L 

4- 

d 

i 

L 

L 

d 

i 

L 

L 

s 

d 

i 

i. 

L 

z 

V 

V 

V 

j 

r 

J* 

4* 

i> 

4. 

V 

jr 

V 

V 

r 

J 

Jr 

Jr 

J) 

r 

J 

Jr 

Jr 

J 

r 

J 

Jr 

Jr 

J 

z 

1) 

Jr 

Jr 

Jf 

zh 

m+Jj 

•riUA. 

s 

JjJ 

■xo». 

U^- 

sh 

-*3 

U^- 

U'5 

s 

-^3 

U^ 

v>= 

z 

L 

Lx. 

JL. 

L 

t 

L 

Lx. 

Lx. 

L 

z 

X 

A 

e- 

6 

‘  (ayn) 

X 

A 

e- 

6 

lb 

J 

<_a 

f 

JS 

JL 

<3- 

0* 

q 

iL 

vd 

k 

3 

£ 

g 

J 

L 

J- 

J 

l 

JO 

JO. 

p- 

P 

m 
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-J 

-k 

O- 

u 

n 

-d 

- 

LS- 

u 

n 

(see  Note  2) 

3 

3~ 

3- 

3 

V 

it) 

4 

o 

h 

■  (  i 

- 

- 

6 

t 

(see  Rule  10) 

_ i 

+  (o-  ‘  vr) 

(c. 

‘  s?)  eS 

y 

(see  Note  3) 

Digraphs  Representing  Urdu  Aspirates  (see  Note  4)  Value 


bh 

ph 

th 

th 

jh 

ch 

dh 

dh 

rh 

kh 

gh 


4-t 


4* 

4^ 


-6- 

4* 

4* 


.Di 


4» 


4^ 


Urdu  Vowels  and  Diphthongs  (see  Note  5) 


Value 


a 

u 

i 

a 

a 

a 

u 

T 

o 

e 

au 

ai 


□ 

* 

o 

□ 

6 

Isn  *  sj,n 

3U 

vs9 

3° 

CU  « 

O  .** 

J'J 

cn  <  Isn 


Notes  to  the  Tables 

1.  For  the  use  of  I  ( alif)  to  support  *  ( hamzah )  and  □  (maddah)  see  rules  1  and  2,  respectively. 

For  the  romanization  of  *  by  ’  ( alif)  see  rule  12.  For  other  orthographic  uses  of  I  see  rules 

3-4. 

2.  For  the  distinction  between  O  and  u  see  rule  6. 

3.  For  the  distinction  between  ^3  and  ^  see  rule  11(c)  and  (e). 

4.  For  the  form  of  the  letter  in  these  digraphs,  see  rule  9. 

5.  Vowel  points  are  used  sparingly,  and  for  romanization  must  be  supplied  from  a  dictionary. 
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RULES  OF  APPLICATION 

Letters  Which  May  Be  Romanized  in  Different  Ways  Depending  on  Their  Context 

1.  I  (alif),  3  and  v5  are  used  to  support  *  ( hamzah );  see  rule  12.  When  so  used,  these  letters  are  not 
represented  in  romanization. 

2.  I  ( alif)  is  used  to  support  □  (j maddah );  see  rule  13.  When  so  used,  it  is  not  represented  in 
romanization. 

3.  I  (alif)  is  used  after  a  consonant  to  indicate  the  long  vowel  romanized  a. 

raj 

karna 

In  some  words  of  Arabic  origin  this  alif  appears  as  a  superscript  letter  over  representing  the  alif 
maq  surah. 

da‘va 

The  alif  is  sometimes  omitted  in  writing.  It  is  always  represented  in  romanization. 

‘Abdurrahman  cX*** jJI  ^  ^ 

When  the  long  vowel  a  is  initial,  it  is  written  T.  See  rule  13(a). 

4.  I  (alif)  may  be  used  as  an  orthographic  sign  without  phonetic  significance.  In  these  cases  it  is  not 
represented  in  romanization.  See  rule  16. 

‘  ilman  L&lc 

5.  appears  as  a  superscript  letter  over  and  v  when  the  latter  represent  the  cerebral  sounds 
romanized  t,  d,  and  r,  respectively. 

6.  Regardless  of  pronuncitation,  undotted  forms  of  the  letter  O  are  romanized  n  and  dotted  forms  are 
romanized  n. 

jahan 

7.  3  is  used: 

(a)  To  represent  the  consonant  sound  romanized  v. 

dev  3-*> 

vujud  ^3*3 

In  some  words  of  Persian  origin  this  consonant,  though  written,  has  ceased  to  be  pronounced. 
It  is  retained  in  romanization. 

khvish 
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(b)  To  represent  the  long  vowel  romanized  u. 

ukh 
Urdu 

(c)  To  represent  the  long  vowel  romanized  o. 

os 
dost 

For  the  romanization  of  the  conjunction  j  as  o  see  rule  19. 

(d)  To  represent  the  diphthong  romanized  au. 

aur 

qaumf 

(e)  To  support  *  ( hamzah ).  See  rule  12. 

u> 

For  the  use  of  □  ( shaddah )  with  j  see  rule  14. 

8.  °  is  used  to  represent  the  consonantal  sound  romanized  h. 

ham 

gah  olS 

Final  °  ,  though  often  not  pronounced,  is  normally  retained  in  romanization. 

kih  ^ 

guldastah 

Exception  is  made  in  the  case  of  words  whose  final  syllable  ends  in  an  aspirated  consonant.  When 
final  &  is  added  to  the  letter  ■©  in  this  position,  it  is  not  represented  in  romanization. 

mukh 


C-91 


'J* 


9.  °  (usually  written  in  the  form  -&)  is  used  to  represent  the  aspirated  element  of  the  sounds 

romanized  bh,  ph,  th,  th,  jh,  ch,  dh,  dh,  rh,  kh,  gh. 

phul  J 

accha 


For  the  writing  and  romanization  of  words  ending  in  an  aspirated  consonant,  see  rule  8. 

10.  &  and  which  are  sometimes  used  interchangeably,  are  both  romanized  t. 

hikmat 


11.  v3  (often  written  «^s)  is  used: 

(a)  To  represent  the  consonant  romanized  y. 

siyasat 

daya 


ALA-LC  Romanization  Tables 


page  229 


Urdu 


(b)  To  represent  the  long  vowel  romanized  L 

tasvir 

Ishvar 

(c)  To  represent  the  long  vowel  romanized  e. 

sher 

neva 


When  with  this  value  is  final,  the  form 
se 

la  rke 


generally  replaces  «3- 


cr^ 


(d)  To  represent  the  long  vowel  romanized  d.  See  rule  3. 

da‘va  3  3^ 

‘uqba 


(e)  To  represent  the  diphthong  romanized  ai. 

maidan 

bail 


When  3  with  this  value  is  final,  it  is  sometimes  written  0  . 

hai  cr® 

(f)  To  support  <=  ( hamzah ).  In  this  position  ,3  is  usually  undotted.  See  rule  12. 

u> 

For  the  use  of  □  (shaddah)  with  <3  see  rule  14. 

For  the  use  of  >3  in  a  muzaf  see  rule  17. 


Romanization  of  Orthographic  Symbols  Other  Than  Letters  and  Vowel  Signs 


Although  vowel  signs  are  frequently  omitted  in  printed  texts,  they  are  always  taken  into 
consideration  in  romanization.  The  rules  for  other  symbols  vary. 

12.  *  ( hamzah ) 


(a)  In  initial  position  *  is  not  represented  in  romanization. 


(b)  In  medial  and  final  position,  when  *  represents  a  consonant,  it  is  romanized  ’  ( alif ). 

mu  *  min  0* 3* 

li’e  cii 

bha’f 
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(c) 


When  *  represents  the  connective  syllable  joining  a  muzaf  to  what  follows,  it  is  romanized  -yi. 
See  rule  17. 

£ 

malikah-yi  Inglistan  oh-iKjl  <^-L° 


13.  □  ( maddah ) 

(a)  At  the  beginning  of  a  word,  or  following  the  Arabic  article  Jl ,  T  is  romanized  a. 

ab 

(b)  At  the  beginning  of  a  syllable  within  a  word,  T  is  romanized  ’5. 

mir’at 

Qur  ’an  erf. 

(c)  □  is  otherwise  omitted  in  romanization. 


14.  □  ( shaddah  or  tashdud )  indicates  the  doubling  in  pronunciation  of  the  letter  over  which  it  is 
written.  It  is  represented  in  romanization  by  doubling  the  letter  or  digraph  concerned. 

caeca 
kha  t  ta 
makkhf 

When  □  occurs  over  3  and  ,3,  these  letters  are  regarded  as  representing  consonants.  They  are 
romanized  vv  and  yy,  respectively. 

quwat 
sayyid 
Zakariyya 

o  _ 

15.  □  ( sukun  or  jazm )  indicates  the  absence  of  a  vowel  following  the  letter  over  which  it  is  written. 
It  is  not  represented  in  romanization. 

16.  Tanvin  (written  □,  Q ,  0  (*□))  is  romanized  un,  in,  an,  respectively,  when  it  occurs  in  a  word  or 
expression  borrowed  from  Arabic.  Otherwise  it  is  not  represented  in  romanization. 

fauran 


Romanization  as  Affected  by  Grammatical  Structure 

17.  Izafat. 

(a)  When  the  muzaf,  the  first  of  two  words  in  the  grammatical  relationship  known  as  izafah,  ends 
in  a  consonant,  -i  is  added  to  it  in  the  romanization. 
tarlkh-i  Hindustan 
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(b)  When  the  muzaf  ends  in  a  vowel  or  in  silent  -yi  is  added. 

darya-yi  shor  *3^^ 

zaban-i  Urdu-yi  mu‘alla 

malikah-yi  Inglistan  ^  «^^-Lq 

For  the  use  of  *  (hamzah )  to  indicate  the  muzaf,  see  rule  12(c). 

18.  The  Arabic  article  Jl  is  romanized  differently  depending  on  the  letters  and  context  with  which  it 
is  associated. 


(a) 


When  it  is  prefixed  to  a  word  beginning  with  a  "moon  letter"  (  £  <  £  < 

^  )  it  is  romanized  al. 
al-Qur’an 


«3  ‘  °  ‘  3  ‘  p 


(b)  When  it  is  prefbced  to  a  word  beginning  with  a  "sun  letter" 

Jo  )  the  l  of  the  article  is  replaced  in  romanization  by  the  same  letter, 

or  digraph,  as  that  which  begins  the  following  word, 
as-sijill 


(c)  When  it  occurs  before  the  second  element  in  a  name,  the  vowel  of  the  article  is  replaced  by 
the  final  vowel  of  the  preceding  word. 

‘Abdul‘azfz 

‘Abdurrashfd 

Abulfazl 

Zulqarnain 

Fazlullah  Ah  I 


19. 


The  conjunction  j,  when  used  to  join  two  closely  associated  members  of  a  phrase,  is  romanized 

o. 

mal  o  asbab  j  JLo 


Otherwise  j  is  romanized  va. 


20.  Rules  for  the  capitalization  of  English  are  followed,  except  that  the  Arabic  article  al  is  lowercased 
in  all  positions. 

21.  The  macron  is  used  with  both  capital  and  lowercase  letters. 

22.  The  hyphen  is  used: 

(a)  To  connect  a  muzaf  with  the  following  vowel  or  syllable.  See  rule  17. 

(b)  To  connect  the  Arabic  article  al  with  the  following  word.  See  rule  18. 
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23.  Foreign  words  in  an  Urdu  context,  including  Arabic  and  Persian  words,  are  romanized  according 
to  the  rules  for  Urdu. 

c- Uo  Lo  ^  jQ>  »_( Lr 

Janab-i  Hed  Mas  tar  sahib-i  Gavarnmant  Ha ’I  Iskul 

For  short  vowels  not  indicated  in  the  script,  the  Urdu  vowels  nearest  the  original  pronunciation 
of  the  word  concerned  are  supplied  in  romanization. 

24.  A  quotation  in  another  language  using  the  Arabic  script  is  romanized  according  to  the  rules  for  the 
language  concerned. 


SPECIAL  CHARACTERS  AND  CHARACTER  MODIFIERS  IN  ROMANIZATION 


Special  Characters 

Name 

USMARC  hexadecimal  code 

5 

alif 

AE 

( 

ayn 

BO 

Character  Modifiers 

Name 

USMARC  hexadecimal  code 

6 

acute 

E2 

□ 

macron 

E5 

□ 

dot  below 

F2 

0 

double  dot  below 

F3 

□ 

underscore 

F6 
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Character  Sets  Used  in  Romanization 


The  transliterations  produced  by  applying  ALA-LC  Romanization  Tables  are  encoded  in  machine- 
readable  form  into  USMARC  records.  Encoding  of  the  basic  Latin  alphabet,  special  characters,  and 
character  modifiers  listed  in  this  publication  is  done  in  USMARC  records  following  two  American 
National  Standards;  the  Code  for  Information  Interchange  (ASCII)  (ANSI  X3.4),  and  the  Extended  Latin 
Alphabet  Coded  Character  Set  for  Bibliographic  Use  (ANSEL)  (ANSI  Z39.47).  Each  character  is 
assigned  a  unique  hexadecimal  (base-16)  code  which  identifies  it  unambiguously  for  computer  processing. 

The  graphics  for  certain  characters  used  in  romanization  are  so  similar  that  they  are  often  confused. 
The  apostrophe  (’),  soft  sign  (prime)  ('),  acute  accent  O,  alif  (’),  and  ayn  (‘)  are  typical  of  characters 
that  are  difficult  to  differentiate.  A  listing  has  been  provided  at  the  end  of  each  romanization  table  that 
shows  special  characters  and  character  modifiers  that  are  used  with  it.  The  following  list  shows  the  entire 
basic  Latin  alphabet,  and  all  special  characters  and  character  modifiers  that  are  used  in  these  romanization 
tables.  Each  unique  character  is  presented  with  its  graphic,  hexadecimal  code,  and  name.  More 
information  on  the  character  sets  used  in  USMARC  records  can  be  found  in  USMARC  Specifications  for 
Record  Structure,  Character  Sets,  and  Exchange  Media  (Washington:  Library  of  Congress,  CDS,  1994). 


Graphic 

Hex  Code 

Name 

Graphic 

Hex  Code  Name 

20 

SPACE  (BLANK) 

? 

3F 

QUESTION  MARK 

! 

21 

EXCLAMATION  MARK 

@ 

40 

COMMERCIAL  AT 

if 

22 

QUOTATION  MARK 

A 

41 

A 

# 

23 

NUMBER  SIGN 

B 

42 

B 

$ 

24 

DOLLAR  SIGN 

C 

43 

C 

% 

25 

PERCENT  SIGN 

D 

44 

D 

& 

26 

AMPERSAND 

E 

45 

E 

9 

27 

APOSTROPHE 

F 

46 

F 

( 

28 

OPENING  PARENTHESIS 

G 

47 

G 

) 

29 

CLOSING  PARENTHESIS 

H 

48 

H 

* 

2A 

ASTERISK 

I 

49 

I 

+ 

2B 

PLUS  SIGN 

J 

4A 

J 

» 

2  C 

COMMA 

K 

4B 

K 

- 

2D 

HYPHEN-MINUS 

L 

4C 

L 

2E 

PERIOD  (DECIMAL  POINT) 

M 

4D 

M 

/ 

2F 

SLASH 

N 

4E 

N 

0 

30 

0 

0 

4F 

0 

1 

31 

1 

P 

50 

P 

2 

32 

2 

Q 

51 

Q 

3 

33 

3 

R 

52 

R 

4 

34 

4 

S 

53 

S 

5 

35 

5 

T 

54 

T 

6 

36 

6 

U 

55 

U 

7 

37 

7 

V 

56 

V 

8 

38 

8 

w 

57 

w 

9 

39 

9 

X 

58 

X 

• 

3A 

COLON 

Y 

59 

Y 

9 

3B 

SEMICOLON 

Z 

5A 

Z 

< 

3 C 

LESS  THAN  SIGN 

[ 

5B 

OPENING  SQUARE  BRACKET 

= 

3D 

EQUAL  SIGN 

\ 

5C 

REVERSE  SLASH 

> 

3E 

GREATER  THAN  SIGN 

] 

5D 

CLOSING  SQUARE  BRACKET 
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Graphic 

Hex  Code 

Name 

\ 

5C 

REVERSE  SLASH 

[ 

5D 

OPENING  SQUARE  BRACKET 

A. 

5E 

SPACING  CIRCUMFLEX 

5F 

SPACING  UNDERSCORE 

4 

60 

SPACING  GRAVE 

a 

61 

d 

b 

62 

b 

c 

63 

c 

d 

64 

d 

e 

65 

e 

f 

66 

f 

g 

67 

g 

h 

68 

h 

i 

69 

i 

j 

6A 

j 

k 

6B 

k 

1 

6C 

1 

m 

6D 

m 

n 

6E 

n 

0 

6F 

o 

P 

70 

P 

q 

71 

q 

r 

72 

r 

s 

73 

t 

t 

74 

t 

u 

75 

u 

V 

76 

V 

w 

77 

w 

X 

78 

X 

y 

79 

y 

z 

7A 

z 

{ 

7B 

OPENING  CURLY  BRACE 

7C 

VERTICAL  BAR  (FILL) 

} 

7D 

CLOSING  CURLY  BRACE 

~ 

7E 

SPACING  TILDE 

t 

A1 

UPPERCASE  POLISH  L 

0 

A2 

UPPERCASE  SCANDINAVIAN  0 

D 

A3 

UPPSERCASE  D  WITH  CROSSBAR 

!> 

A4 

UPPERCASE  ICELANDIC  THORN 

A 

A5 

UPPERCASE  DIGRAPH  AE 

CE 

A6 

UPPERCASE  DIGRAPH  OE 

/ 

A7 

SOFT  SIGN  (PRIME) 

• 

A8 

DOT  IN  MIDDLE  OF  LINE 

\> 

A9 

MUSICAL  FLAT 

® 

AA 

SUBSCRIPT  PATENT  MARK 

± 

AB 

PLUS  OR  MINUS 

O 

AC 

UPPERCASE  O-HOOK 

u 

AD 

UPPERCASE  U-HOOK 

5 

AE 

ALIF 

Graphic 

Hex  Code 

Name 

C 

BO 

AYN 

i 

B1 

LOWERCASE  POLISH  L 

0 

B2 

LOWERCASE  SCANDINAVIAN  0 

d 

B3 

LOWERCASE  D  WITH  CROSSBAR 

1> 

B4 

LOWERCASE  ICELANDIC  THORN 

as 

B5 

LOWERCASE  DIGRAPH  AE 

oe 

B6 

LOWERCASE  DIGRAPH  OE 

// 

B7 

HARD  SION  (DOUBLE  PRIME) 

l 

B8 

LOWERCASE  TURKISH  I 

£ 

B9 

BRITISH  POUND  SIGN 

5 

BA 

ETH 

O' 

BC 

LOWERCASE  O-HOOK 

tf 

BD 

LOWERCASE  U-HOOK 

0 

CO 

DEGREE  SIGN 

l 

Cl 

LOWERCASE  SCRIPT  L 

® 

C2 

PHONO  COPYRIGHT  MARK 

© 

C3 

COPYRIGHT  MARK 

C4 

SHARP 

i 

C5 

INVERTED  QUESTION  MARK 

l 

C6 

INVERTED  EXCLAMATION  MARK 

) 

BO 

PSEUDO  QUESTION  MARC 

h 

El 

GRAVE 

6 

E2 

ACUTE 

□ 

E3 

CIRCUMFLEX 

0 

E4 

TILDE 

□ 

E5 

MACRON 

□ 

E6 

BREVE 

E7 

SUPERIOD  DOT 

□ 

E8 

UMLAUT  (DIERESIS) 

Q 

E9 

HACEK  (CARON) 

a 

EA 

CIRCLE  ABOVE  (ANGSTROM) 

S 

EB 

LIGATURE,  FIRST  HALF 

9 

EC 

LIGATURE,  SECOND  HALF 

d 

ED 

HIGH  COMMA,  OFF  CENTER 

u 

EE 

DOUBLE  ACUTE 

a 

EF 

CANDRABINDU 

Q 

FO 

CEDILLA 

Q 

FI 

RIGHT  HOOK  (OGONEK) 

□ 

F2 

DOT  BELOW 

6 

F3 

DOUBLE  DOT  BELOW 

0 

F4 

CIRCLE  BELOW 

Q 

F5 

DOUBLE  UNDERSCORE 

□ 

F6 

UNDERSCORE 

P 

F7 

LEFT  HOOK  (COMMA  BELOW) 

P 

F8 

RIGHT  CEDILLA 

0 

F9 

UPADHMANIYA 

u 

FA 

DOUBLE  TILDE,  FIRST  HALF 

13 

FB 

DOUBLE  TILDE,  SECOND  HALF 

a 

FE 

HIGH  COMMA,  CENTERED 
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Index  to  Languages 


This  index  lists  the  names  of  languages  covered  by  romanization  schemes  in  this  publication.  After 
each  name  is  an  indication  of  where  the  table  for  that  language  can  be  found.  Some  tables  are  applied 
to  more  than  one  language  (e.g.,  the  table  for  Hindi  is  used  also  for  Awadhi,  Bihari,  Braj,  Kashmiri  in 
Devanagari  script,  Maithili,  Pahari,  and  Rajasthani).  In  these  cases,  an  instruction  to  use  a  specific 
romanization  table  is  given.  A  table  is  generally  named  for  the  language  for  which  it  was  first  developed. 
See  references  from  alternate  names  and  spellings  of  languages  have  been  included  in  the  index  when  they 
differ  significantly  from  the  preferred  form. 


Language  page 

Abaza  see  Non-Slavic  Languages  .  139 

Abkhaz  see  Non-Slavic  Languages  .  139 

Adygei  see  Non-Slavic  Languages  .  139 

Aisor  see  Non-Slavic  Languages  .  139 

Altai  see  Non-Slavic  Languages  .  139-140 

Amharic  . 8 

Arabic .  10 

Argobba  see  Amharic  . 8 

Armenian  . 20 

Assamese . 22 

Avar  see  Non-Slavic  Languages  .  140 

Awadhi  use  Hindi  . 70 

Azerbaijani  (in  Cyrillic  script)  see  Non-Slavic  Languages .  140 

Azerbaijani  (in  Arabic  script) . 24 

Balinese . 30 

Balkar  see  Non-Slavic  Languages  .  140 

Bashkir  see  Non-Slavic  Languages  .  140 

Belorussian  . 34 

Bengali  . 36 

Bhutan  see  Tibetan .  216 

Bihari  use  Hindi  . 70 

Braj  use  Hindi  . 70 

Bulgarian . 38 

Burmese  . 40 

Buryat  see  Non-Slavic  Languages  .  141 

Byelorussian  see  Belorussian  . 34 

Cambodian  see  Khmer  . 96 

Chechen  see  Non-Slavic  Languages  .  141 

Chinese . 43 

Chukchi  see  Non-Slavic  Languages  .  142 

Church  Slavic . 56 

Chuvash  see  Non-Slavic  Languages  .  142 

Coptic  see  Greek . 64 

Croatian  see  Serbian  and  Macedonian .  190 
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Language  page 

Cyrillic  script  see  Belorussian,  Bulgarian,  Church  Slavic,  Non-Slavic 

Languages,  Russian,  Serbian  and  Macedonian,  or  Ukrainian 

Dargwa  see  Non-Slavic  Languages  .  142 

Devanagari  script  see  Bengali,  Hindi,  Marathi,  or  Sanskrit  and  Prakrit 

Divehi . 59 

Dogri  use  Panjabi .  169 

Dungan  see  Non-Slavic  Languages  .  143 

Eskimo  see  Non-Slavic  Languages  .  143 

Even  see  Non-Slavic  Languages  .  143 

Evenki  see  Non-Slavic  Languages  .  143 

Farsi  see  Persian  .  171 

Gagauz  see  Non-Slavic  Languages  .  143 

Galig  see  Mongolian .  132 

Ge’ez  see  Amharic  . 8 

Georgian . 62 

Gilyak  see  Non-Slavic  Languages— Nivkh  .  152 

Gipsy  see  Non-Slavic  Languages— Romany .  152 

Greek . 64 

Gujarati  . 66 

Gurage  see  Amharic  . 8 

Gurmukhi  script  see  Panjabi  .  169 

Hebrew . 68 

Hindi  . 70 

Ingush  see  Non-Slavic  Languages  .  143 

Inuit  see  Non-Slavic  Languages— Eskimo .  143 

Japanese  . 73 

Javanese  . 86 

Kabardian  see  Non-Slavic  Languages  .  144 

Kalmyk  see  Non-Slavic  Languages  .  144 

Kannada . 90 

Karachay  see  Non-Slavic  Languages  .  144 

Karachay-Balkar  see  Non-Slavic  Languages  .  144 

Karakalpak  see  Non-Slavic  Languages  .  144-145 

Karelian  see  Non-Slavic  Languages  .  145 

Kashmiri . 92 

Kazakh  see  Non-Slavic  Languages  .  145 

Khakass  see  Non-Slavic  Languages  .  145-145 

Khanty  see  Non-Slavic  Languages  .  146-147 

Khmer  . 96 

Kirghiz  see  Non-Slavic  Languages  .  147 

Komi  see  Non-Slavic  Languages  .  147 

Komi-Permyak  see  Non-Slavic  Languages .  147 

Konkani  (in  Kannada  script)  see  Kannada . 90 

Konkani  (in  Devanagari  script)  see  Marathi  .  130 

Korean  . 99 

Koryak  see  Non-Slavic  Languages  .  148 
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Kumyk  see  Non-Slavic  Languages  .  148 

Kurdish  (in  Perso-Arabic  script)  .  114 

Lahnda  use  Panjabi .  169 

Lak  see  Non-Slavic  Languages  .  148-149 

Lao  .  116 

Lapp  see  Non-Slavic  Languages  .  149 

Lepcha  .  120 

Lezghian  see  Non-Slavic  Languages  .  149 

Limbu .  122 

Lithuanian  (n  Cyrillic  script)  see  Non-Slavic  Languages  .  150 

Macedonian  see  Serbian  and  Macedonian .  190 

Madurese  see  Javanese  . 86 

Maithili  use  Hindi  . 70 

Malay .  124 

Malayalam .  128 

Maldivian  see  Divehi . 59 

Manipuri  use  Bengali . 36 

Mansi  see  Non-Slavic  Languages  .  150 

Marathi .  130 

Mari  see  Non-Slavic  Languages  .  150-151 

Moldavian  see  Non-Slavic  Languages  .  151 

Molodtsov  see  Non-Slavic  Languages—Komi .  147 

Mongolian  (in  Cyrillic  script)  see  Non-Slavic  Languages  .  151 

Mongolian  (in  Vertical  script) .  132 

Moplah . 134 

Mordvin  see  Non-Slavic  Languages  .  151 

Nanai  see  Non-Slavic  Languages  .  151 

Nenets  see  Non-Slavic  Languages  .  152 

Nepali  use  Hindi  . 70 

Newari  use  Hindi  . 70 

Nivkh  see  Non-Slavic  Languages  .  152 

Nogai  see  Non-Slavic  Languages  .  152 

Non-Slavic  Languages  (in  Cyrillic  script) .  138 

Oriya  .  156 

Ossetic  see  Non-Slavic  Languages  .  152 

Ottoman  Turkish  .  158 

Pahari  use  Hindi  . 70 

Pali  (in  various  scripts)  . 164 

Panjabi .  152 

Permyak  see  Non-Slavic  Languages— Komi-Permyak  .  169 

Persian .  171 

Prakrit  see  Sanskrit  and  Prakrit  .  186 

Punjabi  see  Panjabi .  169 

Pushto .  178 

Rajasthani  use  Hindi  . 70 

Romanian  (in  Cyrillic  script)  see  Non-Slavic  Languages-Moldavian  .  151 
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Romany  see  Non-Slavic  Languages  .  152 

Russian .  184 

Sanskrit . ^  T .  186 

Santali  .  188 

Selkup  see  Non-Slavic  Languages  .  152 

Serbian .  190 

Shor  see  Non-Slavic  Languages  .  152 

Sindhi . 192 

Sinhalese .  194 

Sonthal  see  Santali  .  188 

Sundanese  see  Javanese  . 86 

Syriac  (in  Cyrillic  script)  see  Non-Slavic  Languages— Aisor .  139 

Syriac  (in  Hebrew  script)  see  Hebrew  and  Yiddish . 68 

Tabasaran  see  Non-Slavic  Languages  .  153 

Tajik  see  Non-Slavic  Languages  .  153 

Tamil  .  196 

Tat  see  Non-Slavic  Languages  .  153 

Tatar  see  Non-Slavic  Languages  .  153 

Tegulu  .  198 

Thai .  200 

Tibetan .  216 

Tigre  see  Amharic  . 8 

Tigrinya  .  218 

Tua  Tham  see  Pali .  164 

Turkish  see  Ottoman  Turkish  .  158 

Turkmen  see  Non-Slavic  Languages  . .  153 

Tuva  see  Non-Slavic  Languages  .  154 

Udekhe  see  Non-Slavic  Languages  .  154 

Udmurt  see  Non-Slavic  Languages  .  154 

Uighur  (in  Arabic  script) .  220 

Uighur  (in  Cyrillic  script)  see  Non-Slavic  Languages .  154 

Ukrainian  .  224 

Urdu .  226 

Uzbek  see  Non-Slavic  Languages  .  154 

White  Russian  see  Belorussian  . 34 

Xanty  see  Non-Slavic  Languages-Khanty .  146-147 

Yakut  see  Non-Slavic  Languages  .  154 

Yiddish  see  Hebrew  and  Yiddish  . 68 

Yuit  see  Non-Slavic  Languages— Eskimo .  143 
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